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RANDOM ENRICHED TREES WITH APPLICATIONS TO RANDOM 

GRAPHS 

BENEDIKT STUFLER 


Abstract. We establish limit theorems that describe the asymptotic local and global geometric 
behaviour of random enriched trees considered up to symmetry. We apply these general results 
to random unlabelled weighted rooted graphs and uniform random unlabelled fe-trees that are 
rooted at a fc-clique of distinguishable vertices. For both models we establish a Gromov-Hausdorff 
scaling limit, a Benjamini-Schramm limit, and a local weak limit that describes the asymptotic 
shape near the hxed root. 


1. Introduction 

The study of large random discrete structures lies at the intersection of probability theory and 
combinatorics. A combinatorial approach often involves using the framework of combinatorial 
classes to express the quantities under consideration in terms of coefficients of power series, 
and applying analytic tools such as singularity analysis or saddle-point methods to obtain very 
precise limits and concentration results [26, 31, 9, 19, 37, 21]. From a probabilistic viewpoint, 
the focus is on establishing graph limits describing the asymptotic shape, either locally in so 
called local weak limits [7, 63, 22, 20, 13, 56, 15], or globally in Gromov-Hausdorff scaling 
limits [43, 40, 2, 49, 51, 50, 57], and more recently, also on an intermediate scale in local 
Gromov-Hausdorff scaling limits [10, 69]. 

In this context, many of the objects under consideration such as graphs and planar maps are 
endowed with an operation of the symmetric group, and it is natural to study the corresponding 
unlabelled objects, that is, the orbits under this group action as representatives of objects 
considered up to symmetry. For some types of planar maps this is not a particularly interesting 
endeavour, as their study may often be reduced to half-edge rooted maps which admit only 
trivial symmetries. On the other hand, a variety of discrete structures such as graph classes 
with constraints exhibit a highly complex and interesting symmetric structure. For example, the 
precise asymptotic number of random labelled planar graphs has been obtained roughly a decade 
ago in the celebrated work by Gimenez and Noy [36], but the asymptotic number of unlabelled 
planar graphs is still unknown and obtaining it is surely one of the central contemporary problems 
in enumerative combinatorics. 

The study of objects considered up to symmetry poses a particular challenge. Probabilistic 
approaches in the past treat models of random unordered trees [59, 65, 55, 39, 68]. Gombinatorial 
results on more complex structures were obtained for example by Bodirsky, Fusy and Kang [16] 
and Kraus [45] for models of unlabelled outerplanar graph, by Drmota, Fusy, Kang, Kraus and 
Rue [27] for so called families of subcritical classes of unlabelled graphs, and Drmota and Jin [28] 
and Gainer-Dewar, Gessel and Ira [35] for unlabelled A;-dimensional trees. 

In the present work, we obtain probabilistic limits for a large family of random combinatorial 
objects considered up to symmetry, including random unlabelled rooted graphs that are sampled 
according to weights on their 2-connected components, and random front-rooted unlabelled 
A:-dimensional trees. Rather than treating each model individually, we take a unified approach 
and establish a set of limit theorems that apply to the abstract family of random unlabelled 
7^-enriched trees, with the class TZ ranging over all combinatorial classes. The concept of enriched 
trees goes back to Labelle [47]. Roughly speaking, given a class TZ of combinatorial objects, 
an 7^-enriched tree is a rooted tree together with a function that assigns to each vertex an 
7^-structure on its offspring. The model we consider is an unlabelled 7^-enriched tree with n 
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vertices considered up to symmetry, that we sample with probability proportional to a weight 
formed by taking the product of weights assigned to its local 7^-structures. The limits are formed 
as n becomes large, possibly along a shifted sublattice of the integers. Of course it also makes 
sense to study random labelled pendants of enriched trees, and this endeavour is undertaken 
in [64], 

Recall that a symmetry may be defined as a combinatorial structure together with an automor¬ 
phism. Our approach uses an encoding of symmetries of an 7^-enriched tree by a Sym(7^)-enriched 
plane tree, that is, a plane tree where each vertex is endowed with a local 7^-symmetry. We 
construct two infinite limit objects in terms of random trees enriched with local symmetries, and 
establish weak limits that describe the asymptotic behaviour of the o(\/n)-vicinity of the root 
vertex of the random enriched tree, and the o(\/n)-vicinity near a uniformly at random selected 
node. For the latter, some inspiration was taken by Aldous’ approach [3] on asymptotic fringe 
distributions. In order to study global geometric properties, we define metric spaces based on 
random unlabelled enriched trees that are patched together from a random cover by small spaces. 
Using a size-biased Sym(7^)-enriched tree that is strongly related to the local weak limit, we 
study the asymptotic global metric structure as the number of points of this model of random 
metric spaces becomes large, resulting in a Gromov-Hausdorff scaling limit. 

In order to illustrate the scope of our results, we provide applications to specific models of 
random unlabelled graphs. The first model considered is that of random unlabelled rooted 
connected graphs sampled with probability proportional to a product of weights assigned to 
their 2-connected components. For this model, we obtain a local weak limit that describes the 
asymptotic vicinity near the fixed root, a Benjamini-Schramm limit that describes the asymptotic 
shape near a random vertex, and a Gromov-Hausdorff scaling limit. Moreover, we obtain sharp 
tail bounds for the diameter. In the two local limits, we even obtain total variational convergence 
of arbitrary o(\/n)-sized neighbourhoods of the fixed root and random root, which is best-possible 
as the convergence fails for neighbourhoods whose radius is comparable to ^/n. The setting we 
consider explicitly includes uniform random unlabelled rooted graphs from so called subcritical 
graph classes introduced in [27], such as series-parallel graphs, outerplanar graphs, and cacti 
graphs. As for extremal graph parameters, our results also establish the correct order of the 
diameter. The maximum degree and largest 2-connected component are shown to have typically 
order O(logn). In [27] additive parameters of these graphs such as the degree distribution were 
studied using analytic methods. The two local limits add a probabilistic interpretation to the 
limit degree distributions obtained in [27] for the degree of a random vertex and of the fixed 
root. Furthermore, general results by Kurauskas [46, Thm. 2.1]) and Lyons [54, Thm. 3.2] for 
Benjamini-Schramm convergent sequences of random graphs may be applied to deduce laws of 
large numbers for subgraph count asymptotics and spanning tree count asymptotics in terms of 
the BenjaminUSchramm limit. 

Our general results also apply to random unlabelled k-tiees that are rooted at a front of 
distinguishable vertices. A fc-tree consists either of a complete graph with k vertices, or is 
obtained from a smaller A:-tree by adding a vertex and connecting it with k distinct vertices 
of the smaller A:-tree. Such objects are interesting from a combinatorial point of view, as their 
enumeration problem has a long history, see [58, 34, 33, 32, 23, 11, 41]. They are also interesting 
from an algorithmic point of view, as many NP-hard problems on graphs have polynomial 
algorithms when restricted to fc-trees [8, 38]. Employing recent results for limits of random 
unlabelled Gibbs partitions [66], we obtain a local weak limit for unlabelled front-rooted k-trees 
that describes the total variational asymptotic behaviour of arbitrary o(-yn)-neighbourhoods 
of the root-front. We also obtain a Benjamini-Schramm limit describing the asymptotic shape 
of o(-y/n)-neighbourhoods of a uniformly at random selected vertex. Furthermore, we obtain 
a Gromov-Hausdorff scaling limit. For all three limits, a concentration result is required that 
relates the distances of certain points with respect to the /c-tree metric and to a tree-metric in the 
underlying representation by trees endowed with local symmetries. We obtain this intermediate 
result by locating a hidden Markov chain and applying a large deviation inequality by Lezaud [53] 
for functions on non-reversible Markov processes. 
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As a third application, our results also yield local weak limits, Benjamini-Schramm limits and 
scaling limits for a family of random unordered trees drawn according to weights assigned to the 
vertex degrees. 

Plan of the paper. Section 1 gives an informal description of the topic and main applications. 
Section 2 fixes notation related to graphs, trees and fc-trees, and recalls necessary background 
on local weak convergence, Gromov-Hausdorff convergence and further properties of large 
critical Galton-Watson trees. Section 3 is a concise introduction to the combinatorial framework 
of species of structures and Boltzmann distributions, with a focus on the decomposition of 
symmetries. Section 4 discusses a limit theorem for unlabelled Gibbs partitions, that we are 
going to use in our applications. Section 5 explicitly states some probabilistic and combinatorial 
tools related to random walks and Markov chains. Section 6 presents the contributions of the 
present paper in detail. Specifically, Subsection 6.1 introduce the model of unlabelled 7^-enriched 
trees under consideration, and discusses combinatorial bijections that show how this generalizes 
various models of random graphs considered up to symmetry, in particular unlabelled block- 
weighted rooted graphs and unlabelled front-rooted A;-trees. Subsection 6.2 builds the framework 
regarding the local weak limits of unlabelled enriched trees with respect to the fixed root vertex 
and with respect to a randomly selected point. Subsection 6.3 introduces a general model of 
random metric spaces based on random unlabelled enriched trees, and establishes a scaling limit 
and sharp diameter tail-bounds. Subsection 6.4 presents our applications to random weighed 
unlabelled connected rooted graphs, in particular a scaling limit with respect to the first-passage 
percolation metric, sharp diameter tail-bounds, a local weak limit and a Benjamini-Schramm 
limit. Subsection 6.5 discusses applications to random unlabelled front-rooted fc-dimensional 
trees, for which a scaling limit, a local weak limit and a Benjamini-Schramm limit are established. 
Subsection 6.6 presents further applications to a family of simply generated unlabelled unrooted 
trees. In Section 7 we collect all proofs. 

2. Preliminaries 


2.1. Notation. Throughout, we set 

N = {1,2,...}, No = {0}UN, [n] = {1,2,... ,n}, n G Nq. 

The set of non-negative real numbers is denoted by M>o. We usually assume that all considered 
random variables are defined on a common probability space whose measure we denote by P, and 
let Lp denote the corresponding space of p-integrable real-valued functions. All unspecified limits 
are taken as n becomes large, possibly taking only values in a subset of the natural numbers. 

We write —^ and —^ for convergence in distribution and probability, and = for equality in 
distribution. An event holds with high probability, if its probability tends to 1 as n tends to 
infinity.. We let Op(l) denote an unspecified random variable of a stochastically bounded 
sequence (Xn)n, and write Op(l) for a random variable A„ with -^0. We write jC(X) to 
denote the law of a random variable X. The total variation distance of measures and random 
variables is denoted by cItv- Given a power series f{z), we let [z"']f{z) denote the coefficient of 
z” in f{z). 

2.2. Graphs, trees and k-trees. We are going to consider simple graphs, that have no loops 
or parallel edges. The vertices that are adjacent to a vertex u in a graph G are its neighbourhood. 
The cardinality of its neighbourhood is called the degree of the vertex v, and denoted by dciv). 
A graph is termed connected, if any two vertices may be joined by a path. More generally, for 
k > 1 we say a graph G is /c-connected, if it has at least k + 1 vertices and deleting any k — I 
of them does not disconnect the graph. A cutvertex is a vertex whose removal disconnects the 
graph. Hence 2-connected graphs are graphs without cutvertices and size at least three. 

A graph isomorphism between graphs G and 77 is a bijection between their vertex sets such 
any two vertices in G are joined by an edge if and only if their images in H are. In this case we 
say the two graphs are isomorphic. We say a graph is rooted, if one of its vertices is distinguished. 
Graph isomorphisms between rooted graphs are required to map the roots to each other. More 
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generally, we may consider graphs with an ordered number of distinguishable root-vertices, that 
must be respected by graph isomorphisms. A graph considered up to isomorphism is an unlabelled 
graph. That is, any two unlabelled graphs are distinct if they are not isomorphic. Formally, 
unlabelled graphs are defined as isomorphism classes of graphs. Unlabelled rooted graphs are 
defined analogously. 

A tree is a graph that is connected and does not contain circles. In a rooted tree, we say 
the vertices lying on the path between a vertex v and the root are the ancestors of v. The 
vertices that are joined to v by an edge but are not ancestors are its offspring or sons. The 
collection of the sons of a vertex is its offspring set. The cardinality of the offspring set of a 
vertex u in a rooted tree A is its outdegree and denoted by d\{v). Unlabelled rooted trees are 
also termed Pdlya-irees, in honour of George Polya. 

The complete graph with n vertices is denoted by Kn. That is, in any two distinct vertices 
are connected. A subgraph of a graph is termed an n-clique, if its isomorphic to Kn. A k-tree is a 
graph that may be constructed by starting with a A:-clique, and adding in each step a new vertex 
that gets connected with k arbitrarily chosen distinct vertices of the previously constructed graph. 
The fe-cliques of a A:-tree are also called its fronts, and the k -|- 1-cliques its hedra. In the present 
work, we are only considering /c-trees that are rooted at a front of distinguishable vertices. 

A block B of a graph G is a subgraph that is inclusion maximal with the property of being either 
an isolated vertex, a 2-clique, or 2-connected. Connected graphs have a tree-like block-structure, 
whose details are explicitly given for example in Diestel’s book [25, Ch. 3.1]. We mention a few 
properties, that we are going to use. Any two blocks of G overlap in at most one vertex. The 
cutvertices of G are precisely the vertices that belong to more than one block. 

Any connected graph G is naturally equipped with the graph-metric on its vertex set, that 
assigns to any two vertices the minimum number of edges required to join them. We usually 
denote by dcf, •)• Given a vertex u G C and an integer fc > 0, the k-neighbourhood I4(G, v) is 
the subgraph induced by all vertices with distance at most k from v. We regard Vk{C,v) as 
rooted at the vertex v. The diameter D(C) is the supremum of all distances between pairs of 
vertices. For a rooted graph G*, we may also consider its height H(C'*), which is the supremum of 
all distances of vertices from the root of G*. Given a vertex v, we let hc»(u) denote its distance 
from the root. 

Another metric on C is the block-metric daLocK- The block-distance between any two vertices 
of G is given by the minimum number of blocks required to cover any joining path. By standard 
properties of the block-structure of connected graphs, the choice of the joining path does not 
matter. For any vertex v € G and any integer k > 0 we let Uk{G,v) denote the k-block- 
neighbourhood, that is, the subgraph induced by all vertices with block-distance at most k. We 
regard Uk{G, v) as rooted at the vertex v. 

2.3. Local weak convergence. Let G* = (G, vg) and H* = {H, vh) be two connected, rooted, 
and locally finite graphs. We may consider the distance 

d{G’,H') = 

with Vk{G*) ~ Vk{H*) denoting isomorphism of rooted graphs. This defines a premetric on 
the collection of all rooted locally finite connected graphs. Two such graphs have distance 
zero, if and only if they are isomorphic. Hence this defines a metric on the collection B of all 
unlabelled, connected, rooted, locally hnite graphs. The space (B,dBs) is known to be complete 
and separable, that is, a Polish space. 

A random rooted graph G* G B is the the local weak limit of a sequence G* = (G„, Vn), n G N 
of random elements of B, if it is the weak limit with respect to this metric. That is, if 

lim E[/(G:)] = E[/(G-)] 

n^oo 

for any bounded continuous function / : B —>■ M. This is equivalent to stating 

lim IP(Ufc(G;) ~ G’) = IP(Ufc(G’) ~ G’). 

for any rooted graph G*. If the conditional distribution of Vn given the graph G„ is uniform on 
the vertex set U(G„), then the limit G* is often also called the Benjamini-Schramm limit of the 
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sequence (G„)„. This kind of convergence is often yields laws of large numbers for additive graph 
parameters. 

2.4. Gromov—HausdorfF convergence. Let X* = {X,dx,XQ) and Y* = (Y,dY,yo) denote 
pointed compact metric spaces. A correspondence between X* and Y* is a subset R C X x Y 
containing (xq, Uo) such that for any x G X there is a point y G Y with (x, y) G R, and conversely 
for any y G Y there is a point x G X with (x, y) G R. The distortion of the correspondence is 
defined as the supremum 

dis{R) = sup{|dx(a:i,X 2 ) - dy(yi,y 2 ) | (x 2 ,y 2 ) G 

The (pointed) Gromov-Hausdorff distance between the pointed spaces X* and Y* may be defined 
by 

dcniX, h") = :^ inf dis{R) 

Z R 

with the index R ranging over all correspondences between X* and Y*. The factor 1/2 is only 
required in order to stay consistent with an alternative definition of the Gromov-Hausdorff 
distance via the Hausdorff distance of embeddings of X* and Y* into common metric spaces, 
see [52, Prop. 3.6] and [18, Thm. 7.3.25]. This distance satisfies the axioms of a premetric on 
the collection of all compact rooted metric spaces. Two such spaces have distance zero from 
each other, if and only if they are isometric. That is, if there is a distance preserving bijection 
between the two that also preserves the root vertices. Hence this yields a metric on the collection 
K* of isometry classes of pointed compact metric spaces. The space (K*, don) is known to be 
Polish (complete and separable), see [52, Thm. 3.5] and [18, Thm. 7.3.30 and 7.4.15]. 

2.5. Large critical Galton—Watson trees. In this section we let Tn denote a Galton-Watson 
tree conditioned on having n vertices, such that offspring distribution ^ has average value E[^] = 1 
and finite non-zero variance a^. 

2.5.1. Convergence toward the CRT. The (Brownian) continunm random tree (CRT) is a random 
metric space constrncted by Aldous in his pioneering papers [4, 5, 6]. Its construction is as follows. 
To any continuous function / : [0,1] —>• [0,oo[ satisfying /(O) = /(I) = 0 we may associate a 
premetric d on the unit interval [0,1] given by 

d{u,v) = f{u) + f{v) -2 inf /(s) 

u<s<v 

for u < V. The corresponding quotient space (7/,dT/) = ([0, l]/~,d), in which points with 
distance zero from each other are identified, is considered as rooted at the coset 0 of the point 
zero. This pointed metric space is an M-tree, see [30, 52] for the definition of M-trees and further 
details. The CRT may be defined as the random pointed metric space {Te,d%,0) corresponding 
to Brownian excursion e = (ej)o<t<i of duration one. 

The famous invariance principle, 

(2.1) {Tn, \n-^l‘'dT^,th) A (Te, dre, 0) 

in the Gromov-Hausdorff sense, is due to Aldous [6] and there exist various extensions, see for 
example Duquesne [29], Duquesne and Le Gall [30], Haas and Miermont [39]. 

2.5.2. Tail-bounds for the height and level width. Addario-Berry, Devroye and Janson [1, Thm. 
1.2] showed that there are constants C, c > 0 such that for all n and h >0 

(2.2) P(H(7;) > h)) < Cexp{-ch‘^/n). 

A corresponding left-tail upper bound of the form 

(2.3) IP(H(7;) <h)< Cexp(-c(n - 2)/h^) 

for all n and h > 0 is given in [1, p. 6]. The first moment of the number Lk{Tn) of all vertices v 
with height hj-^{v) = k admits a bound of the form 

(2.4) E[Lfc(7^)] < Cfeexp(—c/c^/n). 
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for all n and A; > 1. See [1, Thm. 1.5]. 

3. Combinatorial species and weighted Boltzmann distributions 

In order to study combinatorial objects up to symmetry, it is convenient to use the language 
of combinatorial species developed by Joyal [44]. It provides a clean and powerful framework in 
which complex combinatorial bijection may be stated using simple algebraic terms. In order to 
make the present work accessible to a large audience, we recall the notions and results required to 
state and prove our main results. The theory admits an elegant and concise description using the 
language of category theory, but we will avoid this terminology and assume no knowledge by the 
reader in this regard. The exposition of the combinatorial and algebraic aspects in the following 
subsections follows mainly Joyal [44] and Bergeron, Labelle and Leroux [14]. The probabilistic 
aspects in the Boltzmann sampling framework is based on a recent work by Bodirsky, Fusy, Kang 
and Vigerske [17, Prop. 38]. 

3.1. Weighted combinatorial species. A combinatorial species T" is a functor or ’’rule” that 
produces for each finite set U a hnite set F\U] of T'-objects and for each bijection a : U ^ V a 
bijective map T’[cr] : iF[U] —>■ F\y\ such that the following properties hold. 

1) F preserves identity maps, that is for any finite set U it holds that 

-^[idu] = id^[;7] • 

2) F preserves composition of maps, that is, for any bijections of finite sets a ■. U ^ V and 
fj' : K — 7 - kF we require that 

F[a'a] =F[a']F[a]. 

The idea behind this is that finite combinatorial objects are composed out of atoms, and relabelling 
this atoms yields structurally equivalent objects. 

We say a combinatorial species F maps any finite set U of labels to the finite set F[U] of 
F-objects and any bijection a :U ^ V to the transport function F[a]. For any two T'-objects 
Fu G F\U] and Fy G F\y] that satisfy F[a\{Fu) = Fy, we say Fjj and Fy are isomorphic and 
fj is an isomorphism between them. The object Ffj has size \Fij\ = \U\ and U is its underlying 
set. An unlabelled F-object or isomorphism type is an isomorphism class of T'-objects. That is, a 
maximal collection of pairwise isomorphic objects. By abuse of notation, we treat unlabelled 
objects as if they were regular objects. 

An M.>Q-weighted species F^ consists of a species F and a weighting u that produces for any 
finite set U a map 

iou : F[U] —)• M>o 

such that uju = ojy o F[a] for any bijection a : U ^ V. Any object F G F[U] has weight ijJu{F). 
By abuse of notation we will often drop the index and write oj{F) instead of uju{F). Isomorphic 
structures have the same weight, hence we may define the weight of an unlabelled T'-object to 
be the weight of any representative. The inventory jT'[n]jt,; is defined as the sum of weights of all 
unlabelled T'-objects of size n. Any species may be considered as a weighted species by assigning 
weight 1 to each structure, and in this case the inventory counts the number of T'-objects. If we 
do not specify any weighting for a species, then we assume that it is equipped with this canonical 
weighting. 

To any weighted species F^ we may associate its ordinary generating series 

n>0 F unlabelled ^-object 

We may form the species Sym(T') of F-symmetries by letting Sym(T')[C/] be the set of all pairs 
(T, a) with F G F\U\ an T'-structure and a an automorphism of F, that is, a bijection a : U ^ U 
with T'[cj](T’) = F. For any bijection 7 : ?7 —)• K the corresponding transport function Sym(T')[ 7 ] 
maps to a symmetry {F,a) G Sym(T’)[?7] to the symmetry (T'[ 7 ](F’), 70 - 7 “^) in Sym(T')[K]. 

There is a canonical weighting on Sym(T') with weights in the power series ring M[[si, S 2 , •. •]]• 
By abuse of notation, we also denote this weighting by w. It is given by 

co{F,a)=ioiF)sfhs^^^--- 
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with Gi denoting the number of cycles of length i of the permutation a. Here we count fixpoints 
as 1-cycles. The cycle index sum of J-^ may be defined is defined by 

^ ^ a;(F,a)/n! G M[[si,S2, ...]]. 

n>0(F,(7)eSym(J')[n] 

The generating series are related by 

(3.1) = 

See for example Chapter 2.3 in the book by Bergeron, Labelle and Leroux [14]. The main point 
of considering symmetries is the following. 

Lemma 3.1 (Number of symmetries). For each unlabelled F-object s with size n there are 
precisely n! symmetries {F,cr) G Sym{F)[n\ such that the isomorphism type t{F) of the F-object 
F is equal to s. 

This follows from basic properties of group operations, see for example Joyal [44, Sec. 3]. In 
particular, if we draw a symmetry (F,cj) from the set Sym(T')[n] at random with probability 
proportional to its weight, then 

(3.2) F{tif) = t)=u;{t)/ ^ .;(.) 

ssT[n] 


for any unlabelled T'-object t of size n. 

We say that two species F and Q are isomorphic, denoted by T" ~ if there is a family 
{au)u of bijections ajj : F[U] —)• G[U], with the index U ranging over all finite sets, such that 
the following diagram commutes for any bijection a : U ^ V of finite sets. 


F 


u]—\f 


O-U 


y\ 


a.y 


Q[u]^^G[v] 


The family {au)u is then termed a species isomorphism from F to G- 

Two weighted species F^^ and G'^ are called isomorphic, if there exists a species isomorphism 
{au)u from F to G that preserves the weights, that is, with v{au{F)) = oj{F) for each finite 
set U and J-'-object F G F\U]. In this case the cycle index sums and hence also the other two 
generating series of F^ and G'^ coincide. 

There are some natural examples of species that we are going to encounter frequently. The 
species SET with SET[[/] = {U} has only one structure of each size and its cycle index sum is 
given by 

■^SEt('Si, S2, • ■ ■) = exp(^ Si/i). 

i>l 

The species SEQ of linear orders assigns to each hnite set U the set SEQ[C/] of tuples (ui,..., ut) 
of distinct elements with U = {ui,... ,ut}. Its cycle index sum is given by 

■^SEq('Si, S2, ...) = !/(!- si). 

The species X is given by X[U] = 0 if |C/| 7 ^ 1 and X[U] = {U} if 17 is a singleton. The species 
0 is given by 0[[/] = 0 for all U, and the species 1 by 1[0] = {0} and 1[U] = 0 for all finite 
non-empty sets U. 


3.2. Operations on species. Species may be combined in several ways to form new species. 
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3.2.1. Products. The product T - Q oi two species T and Q is the species given by 

{T-QW\= □ m]xa[C/2] 

{Ul,U2) 

with the index ranging over all ordered 2-partitions of U, that is, ordered pairs of (possibly 
empty) disjoint sets whose union equals U. The transport of the product along a bijection is 
defined componentwise. Given weightings a; on 7^ and on there is a canonical weighting on 
the product given by 

fiiF,G)=u;{FMG). 

This defines the product of weighted species 

{F ■ gy = ■ O'". 

The corresponding cycle index sum satisfies 

'^(T-QY = . 

We also define the powers of a species by 

..... 

with i factors in total, and define (F‘^)^ = 0 to be the empty species having no objects at all. 


3.2.2. Sums. Let be a family of species such that for any finite set U only finitely many 

indices i with Fi\U] / 0 exist. Then the sum is a species defined by 

(j; J-,)[C/] = □ 

iei iei 

Given weightings oji on Fi, there is a canonical weighting jj, on the sum given by 


y{F)=uj,{F) 


for any i and F G Fi[U]. This defines the sum of the weighted species 


ie/ iei 


The corresponding cycle index sum is given by 


Z. 


E,-^r 




Fp- 


3.2.3. Derived species. Given a species F, the corresponding derived species F' is given by 

F'[U]=F[UY{*u}] 

with *ij referring to an arbitrary fixed element not contained in the set U. (For example, we could 
set *if = {[/}.) Any weighting co on F may also be viewed as a weighting on F' , by letting the 
weight of a derived object F G F'[U] be given by a;[/u{*j^}(F). The transport along a bijection 
fj :[/—)■ G is done by applying the transport F[a'] of the bijection a' : U Y {*f/} -F V U {*y} 
with a'\U = a. The cycle-index sum of the weighted derived species (F^)' is satisfies 

By abuse of notation, we are often going to drop the index and just refer to the >i=-atom. 
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3.2.4. Pointing. For any species J- we may form the pointed species It is given by the product 
of species 

T' = X-T' 

with X denoting the species consisting of single object of size 1. In other words, an J^*-object is 
pair {m,v) of an J^-object m and a distinguished label v which we call the root of the object. 
Any weighting ui on X may also be considered as a weighting on X*, by letting the weight of 
{m,v) be given by oj{m). This choice of weighting is consistent with the natural weighting given 
by the product and derivation operation X ■ X'^ if we assign weight 1 to the unique object of X. 
The corresponding cycle index sum is consequently given by 

3.2.5. Substitution. Given species X and G with ^[0] = 0, we may form the composition o ^ as 
the species with object sets 


(^og)[[/]=U Wx.T[7r]x llg[Q] 


QGtt 


with the index tt ranging over all unordered partitions of the set U. Here the transport {Xog)[a] 
along a bijection a : U ^ V is done as follows. For any object (GgjQgTr) in (-Fo G)[U] 

define the partition 

TT = {a{Q) \ Q e tt}, 

and let 


a : TT —)• TT 


denote the induced bijection between the partitions. Then set 


(J^og)[o-](7r,T, (Gq)q6^) = {^,X[a]{F),{g[a\Q\{gQ))„(^Q)^^). 

That is, the transport along the induced bijection of partitions gets applied to the J^-object and 
the transports along the restrictions (t|q, Q G tt get applied to the ^-objects. Often, we are 
going to write X{g) instead of Given a weighting a; on and a weighting p on there is 

a canonical weighting /x on the composition given by 

^(7r,F,(GQ)Q6^)=a;(F)nKQ)- 

QStt 

This defines the composition of weighted species 


{Xogy = x^ ogr 


The corresponding cycle index sum is given by 


(3.3) Z , 52 , . . .) Zj^L^ i^Zgi' ('^1; ^2? • ■ •)’ (^2? *54; ••O’ ^ (^3? • • •)’ * ■ *)* 

Here X denotes the weighting with (zz*)(G) = n{Gy for all ^-structures G. 


3.2.6. Restriction. For any subset 12 C No we may restrict a weighted species X‘^ to objects 
whose size lies in 12 and denote the result by X^. 


3.2.7. Relations between the different operations. The interplay of the operations discussed in 
this section is described by a variety of natural isomorphisms. The two most important are the 
product rule and the chain rule, which we are going to use often. 


Proposition 3.2 ([44]). Let X^ and be weighted species. 

(1) There is a canonical choice for an isomorphism 

{X^ . g^y ~ .g^ + . {^^y_ 


(2) Suppose that ^[0] = 0. 


Then there is also a canonical isomorphism 

(x‘^og'^y c^{{x^yog'^).{g’'y. 
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Figure 1. Composition of cycles. 


We may easily verify the product rule, as the *-label in • Q'^)' may either belong the 
/"-structure, accounting for the summand (/'‘^)' • , or to the ^-structure, accounting for the 

second summand. The chain rule also has an intuitive explanation. The idea is that the partition 
class or ^-structure containing the *-label in an (/"^ o ^'^)'-structure distinguishes an atom of 
the /"-structure. Hence the (/"^ o ^^)'-structure consists of a o Q'^) composite structure, 

where all atoms of the /"-structure receive a regular ^-structure, except for a marked *-atom, to 
which we assign a derived ^-structure, which accounts for the extra factor ■ 

3.3. Symmetries of the substitution operation. We are going to need detailed information 
on the structure of the symmetries of the composition F o Q. The exposition of this section 
follows Joyal [44, Sec. 3.2] and Bergeron, Labelle and Leroux [14, Sec. 4.3]. We are going to 
discuss the following result. 

Lemma 3.3 (Parametrization of the symmetries of the substitution). Up to isomorphism of 
symmetries, any F o Q-symmetry may be constructed as described below from an F-symmetry 
{F,a) together with a family of G-symmetries {Gr,crr)T with the index t ranging over all cycles 
of the permutation a. 

There is much more to this result, as it lies at the heart of the proof of Equation (3.3). We 
refer the inclined reader to the mentioned literature for details. For the purpose of the present 
paper, it is sufficient to understand how the symmetry and in particular its cycles get assembled. 
We are going to use this later in order to dehne random unlabelled structures based on a tree-like 
decomposition of symmetries. 

The method of construction referred to in Lemma 3.3 is a bit involved, hence let us first recall 
what an /"o ^-symmetry is by definition. Let C/ be a finite set. Any element of Sym(/'o Q)\U] 
consists of the following objects: a partition vr of the set U, an /"-structure F G F[tt\, a family 
of ^-structures {Gq)q^t^ with Gq G G[Q] and a permutation a : U U. The permutation a is 
required to permute the partition classes and induce an automorphism 

u : vr —>■ TT, Q ^ (^{Q) 

of the /"-object F. Moreover, for any partition class Q € tt the restriction a\Q : Q ^ o'(Q) is 
required to be an isomorphism from Gq to G^(^q-) . 

Note that for any cycle r = {Qi, ■ ■ ■ ,Qe) of u, it follows that '■ Qi ^ Qi is an 

automorphism of Gq^ . Hence (Gr, cFt) '■= {Gq^ , ) is a ^-symmetry. The symmetry S together 

with the bijections 7 ^ := (t|q. for 1 < i < ^ — 1 already contain all information about the ^-objects 
Gq^,. .., Gq,, and the restriction cj|Q^u...uQf ■ Indeed, it holds that = ^[ 7 * • • • 7 i](G'r) for all 

1 < i < £ — 1 , hence we may reconstruct the ^-objects. The bijections 7 * contain all information 
about and it holds that (t|q^ = In particular, any fc-cycle (ai,..., a^) 

of the permutation cJt- corresponds to the fcGcycle 

(ai,(T(ai),... ,o-^“^(ai),a 2 ,fT(a 2 ),..., { 02 ),.. .,ak,cr{ak), ■ ■ .,a^~^{ak)) 

of the permutation o'|qju...uq«• 
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This implies that any T o ^-symmetry is isomorphic to a symmetry ((n, F, {GQ)Q^Tr),cr) 
constructed in the following way, which is illustrated in Figure 1. Start with choosing an 
T'-symmetry {m,am)- For any cycle r of the permutation am choose a ^-symmetry {Gr,ar) 
and let Qr denote its set of labels. For every atom e of the cycle r set Qe '■= Qt x {e} and 
{Gq^^ag^) := Sym(^) [/e] (Gr, CTr) with fe'■ Qt ^ Qe the canonical bijection. For any label e of 
the T'-structure m set /(e) := Qe and let vr denote the set of all sets Qe- Thus F := F[f]{m) is 
an T'-structure with label set tt and G := (vr, F, {Gq)q^.Q} is an T" o ^-structure. Let r be a cycle 
of am and V a cycle of ar- Choose an atom 6 of r and an atom a of p. Let i denote the length of 
T and k the length of v. Form the composed cycle c by 

((a, b),...,{a, (p(a), b),..., (p(a), /“^(6)),..., b),..., {iy^~^{a),T^~^{b))). 


Then the product a of all composed cycles is an automorphism of the F o ^-structure G. The 
composed cycles are pairwise disjoint, hence it does not matter in which order we take the 
product. Note that a does not depend on the choice of the a’s but different choices of the 6’s 
result in a different automorphism cj. More precisely, if for a given cycle r of am we choose r(5) 
instead of 6, then the resulting automorphism is given by the conjugation (id, T)(T(id, instead 
of a. But (id, r) is an automorphism of the F o ^-structure C, hence the resulting symmetry 
(C, (id, r)cr(id, t)“^) is isomorphic to {G,a). This implies that the isomorphism type of {G,a) 
does not depend on the choices of the a’s and 6’s. Fixing any canonical way of making these 
choices yields the construction of Lemma 3.3. 


3.4. Weighted Boltzmann distributions and samplers. Boltzmann distributions crop up 
in the study of local limit of random discrete structures and in the limit of convergent unlabelled 
Gibbs partitions. A Boltzmann sampler is a possibly recursive procedure involving random 
choices that generates a structure according a Boltzmann distribution. For example, a subcritical 
or critical Galton-Watson tree may be considered as a Boltzmann distributed plane tree. A 
recursive sampler in this setting is a procedure that draws the offspring of the root and then 
calls itself for each offspring vertex. 

3.4.1. Boltzmann distributions. Let F^^ be a weighted species. For any y > 0 satisfying 0 < 
^^{v) < oo, the corresponding Boltzmann distribution for unlabelled F-objects is given by 

(3.4) (F) = F'^(y)~^ijj{F)y'^^\ F an unlabelled T'-object. 

Given a sequence y = {yj)j^^ of non-negative parameters yj satisfying 0 < ZjTu,(y) < oo, the 
corresponding Polya-Boltzmann distribution is given by 

(3.5) Psym(.F)-,y(^,<^) = ^J-“(y)~^w(F)i^i—, (F, cj) G ^ Sym(T')[m] , 

with ai denoting the number of i-cycles of a permutation a. By Lemma 3.1 and Equation (3.1), 
the Boltzmann distribution for unlabelled objects may be considered as the marginal distribution 
of the F-object in special cases of the Polya-Boltzmann distribution. That is, the F-object of a 
Pgym(j-)<*^,(y,y2 y3^,,,)-distributed F-symmetry follows a ^-distribution. 

3.4.2. Boltzmann samplers. The following lemma allows us to construct Boltzmann distributed 
random variables in the unlabelled setting for the sum, product and composition of species. The 
results of this section have been established by Bodirsky, Fusy, Kang and Vigerske [17, Prop. 38] 
for species without weights, and the generalization to the weighted setting is straight-forward. 

Lemma 3.4 (Weighted Polya-Boltzmann samplers). 

(1) Let F^ and be weighted species, and let X and Y be independent random variables with 
distributions C{X) = ¥sym[r)‘.iand CiY) = ¥gym[g)‘.i- Then {X, Y) may he interpreted 
as an (F • Q)-symmetry over the set [|A|] U [|T|]. If a denotes a uniformly at random 
drawn bijection from this set to [|A| -|- |y|], then 

C{{F ■g)[a]{X,Y)) = Fsym(T.gr,y 
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(2) Let be a family of weighted species, and a family of independent random 

variables with distributions F such that Yli U ^ ^ I drawn 

at random with probability proportional to Zj^kIy), that is 

''K 

F{K = k) = (y)/ Z^. (y), 

i 

then 

C{Xk) = ^Sym(Y:^Xfi),y 

(3) Let and Q'' he species such that = 0 and let y = (yj)jgN « family of non¬ 
negative parameters with 0 < Zgu{y) < oo and 0 < < oo. For each k 

sety’^ = {yk,y 2 k,y 3 k,- ■ ■)■ Let X = {F,a) be a Fsym{yu^),(z^^i{y),z^^ 2 {y^},-}-distributed 

random F-symmetry and let {Yi^k)i,keN an independent family (that is also independent 
of X) of random Q-symmetries such that Yi^k follows a Fgy^(^gi,^ yk-distribution for all 
k,i. We may canonically assign to each cycle r of the random permutation a a unique 
element Y^ of the list (YJ |^|)j>i. For example, we could do this by ordering for each i 
the cycles of a having length (. according to their smallest atom, and assign Yi^i to the 
ith cycle in the ordering. Then {X, {Yt-)^^) corresponds according to Lemma 3.3 to an 
Fo Q-symmetry over some set M. Draw a bijection a : M ^ [|-^|] uniformly at random. 
Then 

-C((J'o Q)[a]{x, {Yr)r)) = ^Sym{F‘^og''),y 

3.5. Combinatorial specifications and recursive Boltzmann samplers. 

3.5.1. Motivation. A recursive procedure is a list of instructions that are to be followed step by 
step and may contain references to the procedure itself. For example, a Galton-Watson tree may 
be described by the procedure that starts with a root vertex and attaches a random number of 
independent calls of itself. 

Often one encounter species admitting a recursive isomorphism such as F^ ~ A + (F^)‘^. If 
this decomposition satisfies a certain property (R), then for any admissible parameter we may 
apply the rules from Section 3.4.2 for the sum, product and composition of species in order to 
construct a recursive (P61ya-)Boltzmann sampler for F. That is, a recursive procedure that 
terminates almost surely and samples objects according to the (P61ya)-Boltzmann distribution. 

For the given example F, such a recursive Boltzmann sampler would first, by the sum rule, 
make a coin flip in order to determine whether it terminates with a single vertex (a Boltzmann 
sampler for or creates, by the product rule, an ordered pair of independent calls of itself. In 
other words, its a Galton-Watson tree. As property (R) guarantees that this process terminates 
almost surely, we also know that this Galton-Watson tree must be critical or subcritical. It is 
clear that not any recursive decomposition can have this desired property. For example, the 
species 1 which consists of a single object with size zero admits an isomorphism 1 ~ I • I, but 
applying the product rule yields a recursive procedure which never terminates. 

Precisely stating property (R) requires us to introduce the complex concepts of weighted 
multi-sort species and samplers, as well as related operations such as combinatorial composition 
and partial derivatives in this context. 

3.5.2. Combinatorial specifications. A 2-sort species TL \s a. functor that maps any pair U = 
{Ui,U 2 ) of finite sets to a finite set Ti\U] = TL\Ui,U 2 ] and any pair a = ((Ti,(T 2 ) of bijections 
cTi : Ui ^ Vi to a bijection LLla] : LLlU] —)• in such a way, that identity maps and composition 
of maps are preserved. A weighted 2-sort species additionally carries a weighting to given by 
family of maps 


^Ui,U2 ■ T^[Ui, U2] —)• M>o 
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for all pairs {U\,U 2 )- The weighting is required to be assign the same weight to isomorphic 
structures. That is, the diagram 


'H[UuU2f-^ 





'H[Vi,V2] 


must commute for all bijections ui : C/i —)■ Vi and <72 : C /2 —>• V 2 . 

The operations of sum, product and composition extend naturally to the multi-sort-context 
Let % and K, be 2-sort species and U = (f7i, 1 / 2 ) a pair of finite sets. The sum is dehned by 

in + jc)[u] = n[u]uic[u]. 

We write U = V + W li Ui = ViVJWi and 1^ H W = 0 for all i. The product is defined by 

{n-ic)[u]= y nivjxjciw]. 

v+w=u 

The partial derivatives are given by 

din[U]= H[UiU{*u,},U 2 ] and d 2 -H[U] = H[Ui,U 2 U {*U 2 }]- 

In order state Joyal’s implicit species theorem we also require the substitution operation for 
multi-sort species; this will allow us to define species “recursively” up to (canonical) isomorphism. 
Let Ti and T 2 be (1-sort) species and M a finite set. A structure of the composition 'H{Fi^F 2 ) 
over the set M is a quadruple (vr, y, a, /3) such that: 

(1) vr is partition of the set M. 

(2) X : vr —)• {1, 2} is a function assigning to each class a sort. 

(3) a a function that assigns to each class Q G vr a T'^(q) object a{Q) G Jx,(Q)[Q]- 

(4) P a L^-structure over the pair (x“^(l), x“^(2)). 

This construction is functorial: any pair of isomorphisms ai, a 2 with ai'. Ti ~ Qi induces an 
isomorphism [a 1 , 0 : 2 ] :'H{Fi,F 2 ) — 'H{Qi,Q 2 )- 

Let be a 2-sort species and recall that X denotes the species with a unique object of size 
one. A solution of the system y = T-L{X,y) is pair (A, a) of a species A with A[0] = 0 and 
an isomorphism a : A ~ T-L{X,A). An isomorphism of two solutions {A, a) and {B,(3) is an 
isomorphism of species u : A — B such that the following diagram commutes: 


A^^n{X,A) 

'H{id,u) 


B 


/3 


^n{x,B) 


We may now state Joyal’s implicit species theorem. Recall that 0 denotes the empty species with 
0[[/] = 0 for all hnite sets U. 

Theorem 3.5 ([44], Theoreme 6). Let Li he a 2-sort species satisfying = 0. If 

{d 2 'H){ 0 , 0 ) = 0, then the system y = Hi^X^y) has up to isomorphism only one solution. 
Moreover, between any two given solutions there is exactly one isomorphism. 


We say that an isomorphism T ~ 'H{X,F) is a combinatorial specification for the species F if 
the 2-sort species IL satishes the requirements of Theorem 3.5. 


3.5.3. Recursive Boltzmann samplers. Given a combinatorial specification F — 'H{X,F) we may 
apply the rules of Lemma 3.4 recursively to construct a recursive Boltzmann sampler that is 
guaranteed to terminated almost surely. A justihcation of this fact is given by Bodirsky, Fusy, 
Kang and Vigerske [17, Thm. 40] for species without weights, and the generalization to the 
weighted setting is straight-forward. Let us demonstrate this with an example. Let F is the 
species of binary unordered rooted trees, where each tree receives weight 1. Any such tree is 
either a single root vertex, or root vertex with two binary trees dangling from it. This yields an 
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isomorphism T — X ^ X ■ , where we set = T -T ^ and the two summands correspond to the 

two described cases. This may be reformulated by T" ~ • Q{T) = T") with ^ = 1 + X’^. It 

holds that (92^)(0, 0) = 0 • ^'(0) = 0, hence for each yi,y 2 , ... > 0 with 0 < Zjr{yi, y 2 , ■ ■ ■) < oo 
we may apply the rules of Lemma 3.4 to obtain a recursive procedure that terminates almost 
surely and whose output follows a -distribution. Briefly described, the procedure starts 

with a root-vertex and then draws a ^-symmetry according to a Boltzmann distribution. There 
are three different outcomes, either the symmetry has size 0, in which the sampler stops, or it has 
size 2, in which case the sampler calls itself recursively to determine the two subtrees dangling 
from the root-vertex, with the parameters depending on whether the ^-symmetry consists of two 
fix-points or a single 2-cycle. 


4. Unlabelled Gibbs partitions and subexponential sequences 

The term Gibbs partitions was coined by Pitman [60] in his comprehensive survey on combina¬ 
torial stochastic processes. It describes a model of random partitions of sets, where the collection 
of classes as well as each individual partition class are endowed with a weighted structure. 

Many structures such as classes of graphs may also be viewed up to symmetry. The symmetric 
group acts in a canonical way on the collection of composite structures over a fixed set, and its 
orbits may be identified with the unlabelled objects. Sampling such an isomorphism class with 
probability proportional to its weight is the natural unlabelled version of the Gibbs partition 
model. 

Let and be weighted species such that ^[0] = 0, and such that the ordinary generating 

series o G’^{z) is not a polynomial. An unlabelled Gibbs partitions is a random composite 
structure 

Sn = i'^ni Fn; (G(3)Qg7r„) 

sampled from the set of all unlabelled T o ^-objects with probability proportional to its weight. 

We are going to study the asymptotic behaviour of the remainder R„ obtained by deleting 
’’the” largest component from S^. More specifically, we make a uniform choice of a component 
Qo G TTn having maximal size, and let denote the T’Lobject obtained from the T'-object F„ by 
relabelling the Qo atom of F„ to a ^-placeholder. 

This yields an unlabelled X' o ^-object 

Rri := (TTn \ {Qo}, {^Q)q&-kX{Qq}) ^ ° G)- 

In the so called convergent case, the remainder is stochastically bounded and even converges 
in total variation toward a limit object. 


Theorem 4.1 ([66, Thm. 3.1]). Suppose that the ordinary generating series G'^{z) has positive 
radius of convergence p, and that the coefficients gi = [z^]Q''{z) satisfy 


9n 

9n+d 


~ p, 


1 

9n 


9i9j ~ ‘^G'-'ip) < 

i-\-j=n 


OC. 


Suppose further that 

ZMG^p) + e, G''\{P + e)'), • ■ ■) < oo 


for some e > 0. Then 

and 


dTv(Rn) R) —^ 0, n —>■ oo 

with R denoting a random unlabelled X' o Q-element that follows a 
bution. 




■Boltzmann distri- 
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5. Probabilistic and combinatorial tools 


5.1. The lattice case of the multivariate central local limit theorem. We will make 
frequent use of the classical local limit theorem for random walks. 


Lemma 5.1 (Central local limit theorem for lattice distributions). Let ^ be a random vector 
in whose support is contained in the lattice a + DZ'^ with D G GLrf(M), a G and in no 
proper sublattice. Suppose that Y has a finite non-zero covariance matrix "S, and let (Yj)j>i be 
a family of independent copies ofY. For all n and y set 

n 

Pniy) := = y) 

i=l 


and, as our assumptions imply that S is positive-definite, we may also set 

I det D| 


Pn{y) ■= 


(27rn)'^/2\/det^ 


exp 


(-^(y - nE[Y])TS-i(y - nE[Y])) . 


Then 


sup \Pn{y) - Pn{y)\ = o{n 

yGa+DZ*^ 

In particular, Pn{y) ~ Pn{y) uniformly for (y — nE[Y])/^/n bounded. 


5.2. A large deviation inequality for functions on finite Markov chains. The following 
Chernoff-type bound for finite Markov chains was established by Lezaud [53] and does not require 
the chain to be reversible. 


Lemma 5.2 ([53, Thm. 3.3]). Let {Xn)n>i denote an irreducible Markov chain on a finite state 
space S with transition matrix P and stationary distribution n. Suppose that the multiplicative 
symmetrization K = P^P is irreducible, and let e(K) denote its spectral gap. Let f : S ^ [—1, 1] 
denote a function whose expected value with respect to the distribution tt equals ^nif] = 0. Let 
b > 0 be a constant such that 0 < ||/||2 < b. Then for each initial distribution q = C.{Xi) and 
each 0 < (5 < 1 and n > 1 it holds that 

P{|/(.Y0 + ... + /(X„)| > S„) < 2N, exp 

where Nq = ||g/7r||2 and 

h{x) = ^ {yi + x-i + 1 ). 

5.3. A deviation inequality for random walk. The following deviation inequality is found 
in most textbooks on the subject. 

Lemma 5.3 (Medium deviation inequality for one-dimensional random walk). Let (Aj)jgf^ he 
an i.i.d. family of real-valued random variables with E[Ai] = 0 and E[e*^^] < oo for all t in 
some interval around zero. Then there are constants 5,0 0 such that for all n x >0 and 
Q < X < 5 it holds that 

E(| Ai Xn\ > x) <2 exp(cnA^ — Ax). 

The proof is by observing that E[e^l^il] < 1 -|- cX? for some constant c and sufficiently small A, 
and applying Markov’s inequality to the random variable exp(A(|Ai| |An|)). 

5.4. The cycle lemma. The following combinatorial result is given for example in Takacs [67]. 

Lemma 5.4 (The cycle lemma). For each sequence ki,... ,kn > —1 of integers with ki = 
—r < 0 there exist precisely r values of0<j<n — 1 such that the cyclic shift 

■ 1 knj) ■— (^l+ji) • • ■ ) k\, . . . ,kj) 

satisfies ^ 11=1 ^i,j > all I < u < n — 1. 
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6. Random unlabelled weighted enriched trees with applications 


We develop a framework for random enriched trees considered up to symmetry and present 
our main applications to different models of random unlabelled graphs. 

Index of notation. The following list summarizes frequently used terminology. 


7^^ 

AOJ 

P 

aTI 

'“n 

{c*Y 

K 

Kn 

JC° 

K 

dci-, •) 

dBLOCK(') •) 

Vk{-) 

Uk{-) 

f{Av) 

iT,l3) 

Tf 

{Tn,(3n) 

7"/ 

(rw,/3W) 

(^(oo)^/5(oo)) 

G 

G 

G 

H* = (H,u*) 


K-weighted species of 7^-structures, page 17 
w-weighted species of 77-enriched trees, page 17 

radius of convergence of the ordinary generating series Afi{z), page 18 
random n-sized unlabelled 77-enriched tree, page 17 

random n-sized unlabelled rooted block-weighted connected graph, page 18 

block-weighted species of rooted connected graphs, page 18 

weighted species of 2-connected graphs, page 18 

species of /c-dimensional front-rooted trees, page 18 

random unlabelled front-rooted A;-tree, page 18 

subspecies of K, of /c-trees where the root front lies in precisely one hedron, 
page 18 

random unlabelled k-lxee from the class /C° , page 18 
graph-metric on a connected graph G, page 4 
block-metric, page 4 
graph-distance /c-neighbourhood, page 4 
block-distance fc-neighbourhood, page 4 

enriched fringe subtree of an enriched tree A at a vertex u, page 19 
random Sym(77)-enriched plane tree, page 19 
fixpoint subtree corresponding to (T,/3), page 20 

the random Sym(77)-enriched plane tree (7^,/3n) conditioned on having n 
vertices, page 20 

fixpoint subtree corresponding to {Tn,l3n), page 20 

size-biased Sym(77)-enriched tree, page 22 

local weak limit of the Sym(77)-enriched tree {Tn,l3n), page 23 

plane tree trimmed at height k, page 23 

^-enriched tree trimmed at height k, page 23 

random ^-object, page 20 

random ^-object with a bias on the number of fixpoints, page 24 
random ^-object with a bias on the number of non-fixpoints, page 24 
the local limit of {Tn,/3n) near a random vertex, page 25 


6.1. Random weighted 77-enriched trees. The concept of 77-enriched trees was introduced 
by Labelle [47], and facilitates the unihed treatment of a large variety of tree-like combinatorial 
structures. 

Given a species of structures TZ, the corresponding species of TZ-enriched trees A-ji is constructed 
as follows. For each finite set U let Ati[U] be the set of all pairs {A, a) with A G A[U] a rooted 
unordered tree with labels in U, and a a function that assigns to each vertex v oi A with offspring 
set My an 77-structure a{v) G IZ[My]. The transport along a bijection a : U ^ V relabels the 
vertices of the tree and the 77-structures on the offspring sets accordingly. That is, »47e[(T] maps 
the enriched tree {A, a) to the tree {B,(3) with B = A[a]{A) and /3 ((t(u)) = TZ[a\M„]{a{v)) for 
each V ^ A. The species of 77-enriched trees admits the combinatorial specification 

( 6 . 1 ) An — X- TZiAn), 


RANDOM ENRICHED TREES WITH APPLICATIONS TO RANDOM GRAPHS 


17 



Figure 2. Correspondence of rooted connected graphs and enriched trees. 




as any 7?,-enriched tree consists of a root vertex (corresponding to the factor X) together with 
an 7^-structure, in which each atom is identified with the root of a further 7^-enriched tree. By 
Theorem 3.5 it holds that given any species J- with an isomorphism T" ~ T • TZ{X), there is a 
natural choice of an isomorphism X ~ A-ji- Hence a large variety of combinatorial structures 
have a natural interpretation as enriched trees. 

Given a weighting k on the species TZ, we obtain a weighting w on the species An given by 

(6.2) oj{A,a) = K{a{v)). 

v&A 

This weighting is consistent with the isomorphism in (6.1), that is, 

(6.3) A^c^X-n^iA^^). 

We are going to study the random unlabelled enriched tree A^, drawn with probability 
proportional to its weight among all unlabelled objects with size n. In the following, we illustrate 
how this model of random enriched trees generalizes various models of random graphs. The list 
is of course non-exhaustive as a huge variety of other special cases of may be found in the 
literature. 

6.1.1. Simply generated Polya trees. For TZ = SET, the species An describes rooted unordered 
trees. The corresponding unlabelled objects are also called Polya trees. Given a weight sequence 
w = (wfc)fc, we may assign weight to each A:-sized 7^-structure. Then A^ is the random 
unordered unlabelled tree such that any Polya tree A with n vertices gets drawn with probability 
proportional to W.u^a^(A(v)' ''^ith d\{v) denoting the outdegree of a vertex v. Note that setting 
weights to zero allows us to impose arbitrary degree restrictions. It is known that, depending on 
the weight-sequence, simply generated plane trees may show a very different behaviour, and it is 
natural to ask the same questions for simply generated Polya trees. 

6.1.2. Random unlahelled connected rooted graphs with weights on the blocks. The species C 
connected graphs admits a decomposition in terms of the species B of graphs that are 2-connected 
consist of two distinct vertices joint by an edge. The well-known combinatorial specification 

(6.4) C* ~ T • SET(H'(C*)) 

is illustrated in Figure 2. This allows us to identify the species C* of rooted connected graphs 
with SET o H'-enriched trees. That is, rooted trees, in which each offspring set gets partitioned, 
and each partition class Q carries a H'-structure, that has |Q| -|-1 vertices, as the =t=-vertex receives 
no label. The isomorphism (6.4) can be found for example in Harary and Palmer [41, 1.3.3, 
8.7.1], Robinson [61, Thm. 4], and Labelle [48, 2.10]. 
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Let 7 be a weighting on the species B. We may consider the weighting cj on C that assigns 
weight 

u:{C) = YI^{B) 

B 

to any graph C, with the index B ranging over the blocks of C. The random graph drawn 
from the unlabelled n-sized C*-objects with probability proportional to its a;-weight is distributed 
like the random unlabelled enriched tree for the weighted species TZ'^ = (SET o B'Y, with k 
assigning the product of the 7 -weights of the individual classes to any assembly of ^'-structures. 

If all 7 -weights are equal either to 0 or 1, we obtain random connected graphs from so called 
block-classes (or block-stable classes), that is, classes of graphs defined by placing constraints on 
the allowed blocks. For example, any class of graphs Ex(Wl) that may be defined by excluding a 
set Ai of 2-connected minors is also block-stable. Here a minor of a graph G refers to any graph 
that may be obtained from G by repeated deletion and contraction of edges. Prominent examples 
are outerplanar graphs Ex(A' 4 , iL 2 , 3 ), that may be drawn in the plane such that each vertex lies 
on the frontier of the infinite face, and series-parallel graphs Ex(iL 4 ), that may be constructed 
similar to electric networks in terms of repeated serial and parallel composition. These two 
classes fall under the more general setting of random graphs from subcritical block-classes in the 
sense of Drmota, Fusy, Kang, Kraus and Rue [27] , which also are special cases of the random 
graph 


6.1.3. Random unlabelled front-rooted k-trees. Let/C denote the 
species of A;-dimensional trees that are rooted at a front of distin¬ 
guishable ^-placeholder vertices. We let K„ denote the random 
unlabelled front-rooted k-tree that gets drawn uniformly at ran¬ 
dom among all unlabelled /C-objects with n hedra. 

We consider the subspecies K,° front-rooted k-trees where the 
root-front is contained in precisely one hedron, and let K° be 
sampled uniformly at random from the unlabelled /C-objects with 
n hedra. 

Any element from /C may be obtained in a unique way by 
glueing an arbitrary unordered collection of /C°-objects together 
at their root-fronts. Hence 



/C~ SET(/C°). 


Figure 3. Decomposition of 
the class /C° for k = 2. 


As illustrated in Figure 3, any /Ci-object may be constructed 
in a unique way by starting with a hedron H consisting of the root-front and a vertex v, and 
then choosing, for each front M oi H that contains v, a k-tree from JC whose root-front gets 
identified in a canonical way with M. Hence 


/C°c..A’.SEQ|,|(/C). 


Combining the isomorphisms yields 


/C°~ A-(SEQ|fc|oSET)(/C°). 

This identifies the species /C° as SEQj^j o SET-enriched trees, and the species K. as unordered 
forest of enriched trees. In particular, K° corresponds to the random unlabelled enriched tree Pkf. 


6.2. Local convergence of random unlabelled enriched trees. In the following, we let IZ^ 

denote an arbitrary weighted species such that the inventory |77.[/c]|k is positive for k = 0 and for 
at least one k>2. We let denote the corresponding species of weighted IZ-enriched trees. 
The radius of convergence of the ordinary generating series Af^{z) will be denoted by p. 
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Figure 4. The encoding of ^ 7 ^-syninietries as Sym(7^)-enriched trees. 


6 . 2 . 1 . A coupling with a random Q-enriched (plane) tree. Our first observation is that any 
symmetry S = {{T,a),a) of an 7^-enriched tree A = {T,a) admits a tree-like decomposition 
in form of a Sym(7^)-enriched tree {T,f3). Indeed, the automorphism a fixes the root o of T 
and permutes the roots of the 7^-enriched trees dangling from o in such a way, that the induced 
permutation cj(o) on the offspring of o is an automorphism of the 7^-structure a(o). This yields 
an 7^-symmetry /3(o) := (a(o), it(o)). For each fixpoint v of the permutation 17(0) it holds that 
the restriction of a to the 7^-enriched fringe subtree f{A,v) (the maximum enriched subtree 
rooted at the vertex v) yields an ^ 7 ^-enriched symmetry (/(^, u), it|„)) and we may proceed 
with the construction of /3 in the same way. For each cycle r = {vi,... ,vt) of a{o) having length 
t >2 the situation is more complicated. We know that a permutes the 7^-enriched fringe subtrees 
f{A,Vi) cyclically. Hence they are all structurally equivalent, and in fact, by the discussion 
in Section 3.3, up to isomorphism composed out of isomorphic symmetries (/(H, Uj), cjj) with 
f{A,vi)- Hence we may proceed with the construction of fd as before, by considering the 
individual symmetries. This process is illustrated in Figure 4. Note that the Sym(7^)-enriched 
tree (T, /3) does not contain all information about the symmetry S, but we may reconstruct S 
up to relabelling from (T, f3). 

The fixpoints of the automorphism a form a subtree of T. Each fixpoint v has a possibly 
empty set f{v) of other fixpoints as offspring, and the remaining offspring correspond to a forest 
F{v) of Sym(7^)-enriched fringe subtrees f{{T,(d),Vi), which consist of non-fixpoints of a. We 
are going to say the triple G{v) := {f3{v), f{v), F{v)) is a ^-object on the fixpoints f{v) and 
define |/(u)| to be its size. Formally, ^-objects do not correspond to any species, but the analogy 
is clear, and we may call (T-^, ) a ^-enriched tree. 

Similarly, we may define the concept of a Q-enriched plane tree, in which the label set of 
each occurring 7^-symmetry is required to belong to the collection {[/c] | A: > 0}. We are going 
to use the following recursive procedure illustrated in Figure 5 in order to sample random 
.AT^-symmetries according to a weighted Boltzmann-distribution. 


Lemma 6.1 (A coupling of random unlabelled 7^-enriched trees with random ^-enriched trees). 
For any parameter a: > 0 with A^{x) < 00 consider the following recursive procedure r5(x) 
which draws a random SymiJZ)-enriched plane tree {T,(3). 

1. Start with a root vertex o and draw a random symmetry 


l3{o) = {R{o),a{o)) 


from the set IJfe>o Sym{TZ)[k] such that f3{o) gets drawn with probability proportional to 


\Rio)\\ 


Here cJi(o) denotes the number of i-cycles of the permutation cr(o), with fixpoints counting as 
1-cycles. 

2. For each cycle r of the permutation cr(o) draw an independent copy (T’^,/3'^) of the recursively 
called sampler TS{x^Af Here |r| > 1 denotes the length of the cycle. For each atom a of t 
make an identical copy (T“,/3“) 0 //3'^). 
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3. Let k denote the size of the Sym{Tl)-structure /3{o). For each label a G [k] add an edge between 
the root vertex o and the root of the plane tree 7~“. The ordering of the offspring set is given 
by the order on the label set [A:]. This defines a plane tree T with root-vertex o. Moreover, for 
each a G [k] and each vertex v G T“ set I3(v) := I3°‘{v). This defines a SymiJZ)-enriched plane 
tree {T,f). 

This procedure terminates almost surely and the resulting SymiJZ)-enriched plane tree {T,I3) 
corresponds to a symmetry on the vertex set of the plane tree T. Let (x) denote the result 

of relabelling this symmetry uniformly at random with labels from the set [n], with n denoting the 
number of vertices of the tree T. Then for any symmetry {A, a) from the set IJfc>o 
it holds that 

r{rZj,^Jx) = {A,a))=u;{A)—Afi{x)-y 


If we condition the sampler TZj^^ (x) on producing a symme¬ 
try with size n, then any symmetry from Sym(^ 7 ^) [n] gets drawn 
with probability proportional to the cj-weight of its ^ 7 ^-object. 

By the discussion in Section 3.1 it follows that the isomorphism 
class of this .4.7^-object is distributed like the random unlabelled 
7^-enriched tree A^. 

Suppose that the radius of convergence p of the ordinary 
generating series Afi{z) is positive. As we state below, it holds 
that Af^{p) is finite and hence we may consider the random 
Sym(7^)-enriched (T, J) be drawn according to the sampler 
TS{p). The vertices of T that correspond to fixpoints of the 
symmetry rZ_ 4 <^(p) form a subtree C T containing the root. 

Note that by the discussion in Section 3.3 the hxpoints correspond precisely to the vertices in 
which the sampler TiS calls itself with parameter p (as opposed to parameter p* for some i > 2). 

For each vertex v of let G-j-f{v) = {j3{v), f{v), F{v)) denote the corresponding ^-object. 
Moreover, let (Tn,/3n), TJ and G^fi ) = (Jn{ ),fn{ ),Fn{-)) denote the corresponding random 

' n 

variables conditioned on the event \T\ = n. Let G be a random variable that is identically 
distributed to the ^-object Gq-f{o) corresponding to the root o of ■ Moreover, let ^ denote 
the number of the fixpoints of G and C, the size of the enriched forest corresponding to the 
non-fixpoints. 



Lemma 6.2 (Properties of the coupling with ^-enriched trees). We make the following observa¬ 
tions. 

(1) The radius of convergence p of Afi{z) and the sum Afi{p) are both finite. 

(2) The size of the tree T satisfies 


\T\= ^(l + |F(n)|). 
v&Tf 

(3) For any SymjZ)-enriched plane tree {T',f3') corresponding to Q-objects Gi,...,G^ it 
holds that 


i 

F{{T,j) = {r,j')) = Yin(^ = Gj. 

i=l 


(4) An arbitrary sequence of Q-objects Gi = {Si, f, FJ, i = 1,... corresponds to a Sym{7Z)- 
enriched tree if and only if 


i m 

\fi\ = £ — 1 and \fi\ > m for all 1 < m < £ — 1. 
i=l i=l 

Let Gj = (Sj,fi,Fj) denote independent identical copies ofG. Let L denote depth-first- 
search ordered list L of the Q-objects of tJ and \L\ its length. Then (L | \L\ = £) is 
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distributed like 


((Gi, I ^(1 + |Fj|) = n, ^ |fj| = £ - 1,^ |fi| >m for all I <m < i - 1). 

i=l 2=1 2=1 

(5) The plane tree is distributed like a Galton-Watson tree with offspring distribution f 
having probability generating function 

E[2;^] = Z'jin[zAfi{p), Afiip^)^.. .)p/Afi{p). 

(6) Given , the forests {F{v))y^-j-f are eonditionally independent. The eonditional distri¬ 
bution of each forest depends only on the outdegree di^f{v). The distribution of the forest 
size C is given by its probability generating function 

E[2;^] = ZTiK(Afi{p),Afi{{pz)‘^),Afi{{pz)^),.. .)p/Afi{p). 

In a more specific setting, where the random vector (^, Cf) has finite exponential moments, 
even more can be said. 

Lemma 6.3 (Further properties of the coupling with 7^-enriched trees in a specihc setting). 
Suppose that p > 0 and that the function 

E{z,u) = zZTiffu,Afi{z‘^),Afi{z^),...) 
satisfies E{p + e,A^(p) + e) < oo for some e > 0. 

(1) Then the nth coefficient of Afi{z) is asymptotically given by 


as n = 1 mod span(w) tends to infinity. 

(2) The series Afi{z) has square root singularities at the points 

Sk = pexp(27rzA:/span(w)), A: = 0, ..., span(w) — 1, 

with local expansions as analytic functions of \J\ — zjSk- 

(3) The offspring distribution f of the Galton-Watson tree and the random variable Q 
have finite exponential moments. Moreover, 

E[e] = Euip,Af,ip)) = p{TZ^Af,{p) = 1, 

V[e] = Euuip,AUp))AUp), o.nd E[C] = Effp,AUp))p/^n{p) - 1- 

Gonsequently, 

P(|r| =n) 




(4) Suppose that at least one TZ-structure with positive n-weight has a non-trivial automor¬ 
phism. Then the lattice spanned by the support of (^,C) has a 2-dimensional X-basis 
B G 1?^“^, and the covariance matrix El of (fX) is positive-definite. Set 

1 2 det El |detB| 

p = z— —zzrzr, <T = ^TTFZTTz — —zzrzrzz, and d = 


1 + E[C]’ V[C](1+E[C])3’ 

Then, as n = 1 mod span(w) tends to infinity, 

d 


span^w ) 




x 


, exp(—- „ 

20 - 2 ' 


uniformly for all bounded x satisfying 

i := pn-\- xffn G n + dZ. 

In particular. 


\TJ\- np 


\r m 
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Properties (1) and (2) are an application of results by Bell, Burris and Yeats [12]. The 

requirement in (4) that TZ has at least one structure (with positive k- weight) with non-trivial 

f 

symmetries is not really a restriction. If it fails, then almost surely |7n | = n for all re, which 
makes the analysis of even easier. 


6.2.2. Local convergence around the fixed root. Labelle 
established in [47, Thm. A] the following decomposition 
of pointed 7^-enriched trees, which will aid us studying 
the behaviour of the 7^-structures along paths starting 
from the root in random enriched trees. The weighted 
species satishes an isomorphism Af^ ~ X ■ TZ'^{Afi). 
The derivative operator satisfies a product rule similar 
to the product rule for the derivative of smooth func¬ 
tions, see Proposition 3.2. Hence applying the pointing 
operator yields a weight-compatible isomorphism 



Figure 7. The sampler 


A\-An + X- n'{An) • A\. 

We may apply Joyal’s implicit species theorem [44, Th. 6] in order to unwind this recursion and 
obtain an isomorphism 


OO 

~ ^ ^W, Af = {X- {An) fAn 

corresponding to the pointed enriched trees ((A, oi),v) in which the outer root v has height t in 
the rooted tree A. The correspondence is illustrated in Figure 6. Again, this isomorphism is 
compatible with the weightings, and we may use it to construct the following sampler illustrated 
in Figure 7. 

Lemma 6.4 (A modihed random Sym(7^)-enriched tree). For any integer i >0 and parameter 
X > 0 with (.4.^^)‘^(x) < OO consider the following recursive procedure rS^^\x) that samples a 
random Sym{TZ)-enriched plane tree {T^^\l3^^'^) together with a distinguished vertex r which we 
call the outer root. 

1. If i = 0 then return (an independent copy of) the random enriched plane tree {T,/d) from 
Lemma 6.1 with the outer root being the root-vertex ofT. Otherwise, if i > 1, then proceed 
with the following steps. 
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2. Start with a root vertex o and draw a random TZ'-symmetry {R,cr) from Ufc>o Sym(Tl')[k] with 
probability proportional to 

Set k := \R\ and make a uniformly at random choice of a bijection f from the set [/c] U {*fc} 
of labels of the TZ-structure R to the set of integers [/c + 1]. Relabel the symmetry via the 
transport function: 

f3{o) := {R{o),a{o)) := Sym{TZ)[f]{R,a). 

Let b := f{*k) denote the vertex corresponding to *k- 

3. Note that b is a fixpoint of the permutation cr{o). For each cycle r ^ ( 6 ) of a(o) draw an 

independent copy of the sampler with |r| > 1 denoting the length of the 

eycle. For each atom a of the cycle r make an identical copy of 

4- Draw an independent eopy of the sampler 

5. For eaeh label a € [fc + 1] add an edge between the root vertex o and the root of the plane tree 
T°‘. The ordering of the offspring set is given by the order on the label set [/c + 1]. This defines 
a plane tree T with root-vertex v. Moreover, for each o G [A: + 1 ] and each vertex u G T“ set 
f3{u) := /3“(tt). This defines an Sym{TZ)-enriched plane tree . 

This procedure terminates almost surely. As described in Section 6.2.1, the resulting SymiJZ)- 
enriched plane tree corresponds to a symmetry on the vertex set of the tree . Let n 

denote the number of vertices of the tree and let TZ Ji){x) denote the result of relabelling this 

symmetry uniformly at random with labels from the set [n]. Then TZ (e){x) satisfies a weighted 

Boltzmann distribution, that is, for any symmetry {{A,r),a) from the set Sym{A!:^)[k] we 
have that 

(6.5) ¥{TZ^(^{x) = {{A,r),a)) = u:{A)—A^^ (x)-^ 

In partieular, 

P(rZ_^w(x) = {iA,r),a)) = (x))^P(rZ^^(x) = {A, a)). 

Suppose that p > 0 and consider the Sym(7^)-enriched tree generated by the 

procedure TS^^^p). The path from the root to the distinguished vertex in is its spine. If 
we set £ = oo, the above construction yields an infinite but locally finite Sym(7^)-enriched tree 
(T(“),/3(“)) having an infinite spine. We will show that this object, is the local limit weak limit 
of the random graph Af, if certain conditions are met. 

In order to formalize our notion of local convergence, we require the concept of trimmed 
^-enriched trees. For any ^-enriched plane tree (r, 7 ) and any non-negative integer k let (r, 7 )^*^^ 
denote the result of trimming at height k. That is, 

(r, 7 )<*^> = (r[^l,(7(u))^g.^[fc-i]), 

with denoting the plane tree trimmed hat height k. That is, we delete all vertices from r 
with height larger than k. In order to simplify notation, we also set {F', ■= (t, 7 )^^^ for 

the Sym(7^)-enriched plane tree {T',/3') corresponding to (r, 7 ). 

Theorem 6.5 (Local convergence of random unlabelled 7^-enriched trees). Suppose that the 
ordinary generating series generating series Afi{z) has radius of convergence p > 0, and that the 
series 

E{z,u) = zZ'jiK(u,Afi{z‘^),Afi{z ^),...) 

satisfies E{p -|- €,A:^{p) + e) < 00 for some e > 0. Then for any sequence kn = o{^/n) of 
non-negative integers it holds that 

dTY{{Tn,/3n)^’^->, ( 7 -(oo)^ ^(oo)^ q 


as n beeomes large. 
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The limit object admits a more accessible description in terms of ^-enriched 

trees, that we are going to use in the proof of Theorem 6.5. Let G denote a random variable 
that is distributed like the G-object corresponding to the root of . Here we do not explicitly 
distinguish the vertex corresponding the vertex of the T^'-gyxnmetry. 


Lemma 6 . 6 . Suppose that p> 0, E[^] = 1 and that (^, C) has a finite covariance matrix. 

(1) The Q-object corresponding to the root ofT^°°'^ together with the fixpoint corresponding to 
the second spine-vertex is distributed like G with a uniformly at random selected marked 
fixpoint. 

(2) The distribution of the limit enriched tree may be described as follows. There 

are normal fixpoints and mutant fixpoints, and we start with a mutant root. Each normal 
fixpoint receives as Q-object an independent copy ofQ, and each of the fixpoints of this 
Q-object is declared normal. Ever mutant fixpoint receives an independent copy of G, and 
one of the corresponding fixpoints is selected uniformly at random and declared mutant, 
whereas the remaining fixpoints are declared normal. 

(3) Let (r, 7 ) denote a Q-enriched tree with height at least k. Let Gi,...,Gt denote the 

depth-first-search ordered list of the Q-objects of Then 

t 

P((.^(oo)^^(oo))<fc> ^ (^,y)<fc>) = Lfc(r)nP(G = Gi). 

i=l 

(4) For any Q-enriched tree (r, 7 ) with 7 ( 1 ;) = {/3r{v), fr{v), Ft-{v)) and any integer k > 0 we 
set 


Lkir) = \{v G T \ hriv) = k}\, lI{t) = ^ \Fr{v)\, (t) ='^ Lf (t). 

vGt i=0 

hr {v) = k 

(^, C) denote the sizes of the fixpoints and non-fixpoints of G. When {t,^) is the 
Q-enriched tree corresponding to (3^°°'^), it holds for all k > 1 that 

E[Lfc(r)] = fc(E[|]-l) + l 

E[|r[^l|] = A;(fc + 1)(E[|] -l)/2 + A; + l 

E[Lf(r)] = A:(E[e1-lMC]+E[a 
E[hS{t) - Irl'^llEK]] = (fc + l)(E[Cl -E[C]) 

Y[hI{t) - Irl'^llEK]] = k{k + 1)(E[|] - l)V[C]/2 + {k + 1)(V[C] + V[C]) 


6.2.3. Local convergence around a random root. Inspired by Aldous’ approach [3] on fringe 
subtrees of random trees, we may also treat local convergence with respect to a uniformly at 
random drawn root of the random unlabelled enriched tree fkf in a similar manner. Let v* be an 
uniformly at random drawn vertex of the tree 7^, and let vq denote the unique nearest vertex 
of tJ . That is, vq = v* if v* G TJ , and otherwise vq is the unique vertex of the fixpoint tree 
whose G-object contains v*. For any i > 1, let Vi denote the z’th predecessor of vq in the fixpoint 
tree Tn , if this predecessor exists. If not, that is, if vq has height greater than i in Tn , then set 
Vi = o and f{{Tn, l3n),Vi) = ^ for some symbol o not contained in the set of ^-enriched trees. 
(For example, we could use the empty set.) For any k > 0 we consider the vector of increasing 
fringe subtrees 

Hffc] := mO<^<k := {fUTn, fin), Vo),..., f{{Tn, fin),Vk)). 

We are going to establish convergence of these random vectors of enriched trees toward 
extended enriched fringe subtrees of a limit object, which we introduce in the following lemma. 


Lemma 6.7. Suppose that p> 0, E[^] = 1 and that (^, C) has a finite covariance matrix. 
(1) Let G = (S,f, F) denote the random Q-object with distribution given by 
E(G = (5, /, F)) = (1 + |F|)P(G = (5, /, F))/(l + E[C]). 
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We define an infinite random Q-enriched tree H in terms of its sequence (Hfc)fc>o of 
increasing extended enriched fringe subtrees. The distribution of the fringe subtree tree 
is given as follows. There are normal fixpoints and special fixpoints, and we start with a 
special root. Each normal fixpoint receives as Q-object an independent copy ofG, and all 
fixpoints of this Q-object are declared normal. Every special fixpoint with height less than 
I receives an independent copy ofG, and one of the corresponding fixpoints is selected 
uniformly at random and declared special, whereas the remaining fixpoints are declared 
normal. A special fixpoint with height I receives G and all fixpoints of this Q-object are 
declared normal. 

Then the special vertices of H form an infinite spine uo,ui,... that grows backwards, 
with 

for all k. We distinguish a point u* that is drawn uniformly at random from the set 
{uo} U F(mo)? with F(iio) denoting the set of non-fixpoints of Q-object corresponding to 
the root of Hq, and set H* = 

(2) Eor any two Q-enriched trees A and A!, let Q{A,A') denote the number of fixpoint sons 
V of the root of A with f{A,v) = AG Eor any increasing finge subtree representation 
H = (-fri)o<i<fc of a Q-enriched tree we set 

k 

i=l 

Then p(H) counts the number of fixpoints v at height k in with the property, that the 
extended enriched fringe subtree respresentation with respect to v is identical to H. 

(3) Let u be either the root of Hq or a non-fixpoint of Q-object corresponding to the root of 
Hq, and let Gi,... ,Gt be the Q-objects corresponding to H^. Then 

t 

= H) = p(H)(l + E[C])-' n 

i=l 

(4) For any Q-enriched tree (r, 7 ) with j(v} = fr{v), Ft-{v)) let = |r| denote 

its number of fixpoints, and ff{T,'^) = + Fr{v)) its total size. Then for any 

sequence kn = of non-negative integers with tn = o(l) it holds with probability 

tending to one that 

#/Hfc„ < ntn and |#Hfc„ - E[C]#/Hfc„| < \/n^. 


We may now establish convergence of the extended enriched fringe subtrees, that will help 
us to apply our main theorems to specific examples of random discrete structures, in particular 
random graphs. 

Theorem 6.8 (Local convergence of random unlabelled 7^-enriched trees around a random root). 
Suppose that the ordinary generating series generating series Afi{z) has radius of convergence 
p > 0, and that the series 

E{z,u) = zZn^{u,AUz^),AUz^),...) 

satisfies E{p + e,A‘^{p) + e) < oo for some e > 0. Then for any sequence kn = o{^/n) of 
non-negative integers the increasing fringe subtree sequence Flj^ j of length kn, corresponding to 
the uniformly at random drawn vertex v* of {Tn, fdn), converges in total variation to the fringe 
subtree sequence of H with the same length. That is, 

(6.6) dTv((Hf,„],u*),(H[fcj,u*))^0. 

Here u* denotes a random vertex of FI that we defined in Lemma 6.7. 
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6.3. Scaling limits of metric spaces based on 7^-enriched 
trees. 


(C/a, 5(a)) 



Figure 8. Patching together 
discrete metric spaces. 


6.3.1. Patching together discrete metric spaces. We study metric 
spaces patched together from metrics associated to the vertices 
of a tree. Let ^ be a rooted tree. For each vertex u G A let My 
denote its offspring set. Let 5 be a map that assigns to each vertex 
n of ^ a metric S{v) on the set Uy := My U {u}. This induces a 
metric d on the vertex vertex set V (A) that extends the metrics 
5{v) by patching together as illustrated in Figure 8 . Formally, we 
define this metric as follows. Consider the graph G on the vertex 
set of A obtained by connecting any two vertices x 7 / y if and 
only if there is some vertex v of the tree A with x,y G Uy and 
assigning the weight S{v){x,y) to the edge. The resulting graph is 
connected and the distance of any two vertices a and b is defined 
by the minimum of all sums of edge-weights along paths joining 
a and b in the graph G. 

Suppose that for each finite set U and each 7^-structure R G TZ[U] we are given a random 
metric dpt on the set U U {*[/} with *u denoting an arbitrary fixed element not contained in U. 
For example, we could set *u := {U}. Let = (A„, an) denote the random re-sized 7^-enriched 
tree drawn with probability proportional to its w-weight. We construct a random re-element 
metric space Y„ as follows. For each vertex v of A^ with offspring set My let 5„(u) be the metric 
on the set My U {u} obtained by taking an independent copy of and identifying with 

V. Let dy^ denote the metric patched together from the family (5n('i^))'u as described in the 
preceding paragraph. 

In order for this to be a sensible model of a random tree-like structure we require the following 
two assumptions. 


(1) We assume that there is a real-valued random variable x > 0 such that for any 7^-structure 
R the diameter of the metric 6r is stochastically bounded by the sum of |i?| independent 
copies xf,... ,X|^| of X- 

(2) For any bijection a : U ^ V of finite sets and for any 7^-structure R G R-lU] we require 
that the metric 57 ^[o-](_r) is identically distributed to the push-forward of the metric Sr by 
the bijection a : U U {*f/} V U {*v} with a\u = a. 

Recall that by Lemma 6.3 the radius of convergence p of A^{z) is finite, and the sum A^{p) 
is finite as well. 


Theorem 6.9 (Scaling limits in the unlabelled setting). Suppose that the ordinary generating 
series generating series A^i{z) has radius of convergence p > 0 and that the series 

E{z,u) = zZti 4 u,Ar{z‘^),Ar{z^), ...) 

satisfies 

E{p + e, Api{p) + e) < 00 

for some e > 0. Then the rescaled space (Y^, re“^/^ciY„) converges weakly to a constant multiple 
of the (Brownian) continuum random tree Te with respect to the Gromov-Hausdorff metric as 
re = 1 mod span(w) tends to infinity. 

An explicit expression of the scaling constant in Theorem 6.9 is given in the corresponding 
proof in Section 7.2. It is interesting to note that the local weak limit contains 

some information on the scaling limit, as it is responsible for one of the factors in the scaling 
factor. We also provide the following sharp tail-bound for the diameter. 

Theorem 6.10 (tail bounds for the diameter in the unlabelled setting). Under the same 
assumptions of Theorem 6.9 there are constants G,c> 0 such that for all re and x >0 it holds 
that 

P(D(Ya) > x) < G(exp(—cx^/n) + exp(—cx)). 
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Again it holds that if x is bounded, then we have the tail-bound 

P(D(Y„) > x) < C'exp(—cx^/n) 

for some constants C, c > 0 . 

The main idea of the proofs of Theorems 6.9 and Theorem 6.10 is that we may use the random 
Sym(7^)-enriched tree of Lemma 6.4 to relate for any two vertices x, y G tJ, distances 

dy (x,y) and d^f{x,y) by constant factor. The following basic observation then takes care of 

' n 

the rest. 


Lemma 6.11. Suppose that p > 0 and that the function 

E{z,u) = zZt^.{u,AUz^),AUz^), ...) 

satisfies E{p + e,Af^[p) -|- e) < oo for some e > 0. Then the following assertions hold: 

(1) There are constants C,c > 0 such that for all n and x > 0 

P(max(|/n(u)| -|- |F,i(u)|) > x) < exp(—cx). 

veTj 

(2) For any vertex v let denote the dy ^-diameter of the subspace {u} U En(v) C Y„. Then 
there are constants C,c> 0 such that for all h >0 

P(maxZl.i; > h) < exp(—c/i). 


u&TJ 


(3) We have that 


V(l+E[C])V[e] . d 

> n 


2^/n 


■% 


in the Gromov-Hausdorff sense as n = l mod span(w) tends to infinity. 


A result similar to Lemma 6.11 was used in [59] to provide a combinatorial proof for the 
scaling limit of uniform random Polya trees (with possible degree restrictions). Theorem 6.9 is 
more general, as it applies for example to scaling limits of models of random graphs with respect 
to the first-passage-percolation metric, and its proof is deeper and more involved, as it requires 
the interplay with the Sym(7^)-enriched tree {fT^^\ of Lemma 6.4 that is related to the local 
weak limit. 


6.4. Applications to random nnlabelled rooted connected graphs. Let C denote the 
class of connected graphs and B its subclass of graphs that are two-connected or a single edge 
with its ends. Recall that the rooted class C* may be identified with the class of SETo.B'-enriched 
trees as discussed in Section 6 . 1 . 2 . Suppose that we have a weighting 7 on the class B, that is, 
for each .B-graph B we are given a weight 7 (B) > 0 such that the weights of isomorphic graphs 
agree. This induces a weighting k on the species SET o B' by setting the weight of a set of graphs 
to the product of the individual weights. Hence we also have a weighting a; on C given by 

w(c)= 

B 

for all C-objects C, with the index B ranging over all blocks of the graph C. In the following, we 
study random unlabelled rooted graph Cif drawn from the unlabelled C*-objects of size n with 
probability proportional to its a;-weight. This corresponds to the model of random unlabelled 
enriched trees in Section 6.1 for the special case IZ'^ = (SET o B')'^. 

Under the premise that the cycle index sums related to the random graph Qf satisfy Equa¬ 
tion (6.7), we establish a local weak limit for the vicinity of the fixed root in Theorem 6.12 
and a BenjaminUSchramm limit in Theorem 6.13. In both cases we actually establish total 
variational convergence of arbitrary o(-y/n)-neighbourhoods, which is best-possible in this setting. 
We also consider the first-passage-percolation metric on the graph C‘^, which is more general 
than the graph-metric, and establish sharp exponential tail-bounds for the diameter and a 
Gromov-Hausdorff scaling limit in Theorem 6.14. As a byproduct, we also obtain a bound for 
the size of the largest 2 -connected component. 
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As an important special case, these Theorems apply to uniform random unlabelled rooted 
graph from a subcritical block class. This model was studied by Drmota, Fusy, Kang, Kraus and 
Rue in [27, Def. 10], and includes uniform random rooted unlabelled cacti graphs, outerplanar 
graphs, and series-parallel graphs [27, Thm. 15]. The scaling limit Theorem 6.14 is a strong 
result in this context and also establishes the correct order of the diameter of this type random 
graphs. 

6.4.1. Local weak limit. The infinite Sym(SETo;B')-enriched tree from Section 6.2.2 

is naturally also a SET o .B'-enriched tree, and may hence be interpreted as an inhnite locally 
finite random graph C according to the bijection in Section 6.1.2. Theorem 6.5 yields local weak 
convergence of the random graph with respect to neighbourhoods around its fixed root vertex. 

Theorem 6.12 (Local convergence of random unlabelled graphs). Suppose that the weighted 
ordinary generating series has2 radius of convergence p > 0, and that 

(6.7) E{z,u) = zZ^setoB') 4^, iCTiz'"), ■ ■ •) 

satisfies E{p + e, (C*)^(p) + e) < oo for some e > 0. Then for any sequence kn = o{y/n) of 
non-negative integers it holds that 

( 6 . 8 ) dTv{UkAC^J,UkAQ)^0, 

and likewise for the graph-metric neighbourhoods Vk„{-). Thus, the infinite random graph C is 
the local weak limit of the random graph C‘f as n becomes large. 

Note that this form of convergence is best-possible, as the diameter of has order ^/n by 
Theorem 6.14, and hence ( 6 . 8 ) fails if the order of kn is comparable to y/n. 

6.4.2. Benjamini-Schramm limit and subgraph count asymptotics. The inhnite ^-enriched tree 
H* from Section 6.2.2 may be interpreted as an inhnite locally hnite random graph C* according 
to the bijection in Section 6.1.2. Theorem 6.8 yields Benjamini-Schramm convergence of the 
random graph Cn- 

Theorem 6.13 (Benjamini-Schramm convergence of random unlabelled graphs). Suppose that 
the weighted ordinary generating series {C*)‘^{z) has radius of convergence p > 0, and that 

E{z,u) = zZ^setoB') 4^, {CTiz‘^), (CTiz^), ■ ■ •) 

satisfies E{p -|- e, (C*)‘^(p) -|- e) < oo for some e > 0. Let v* be a uniformly at random drawn 
vertex of the random graph Cn- Then for any sequence kn = o{y/n) of non-negative integers it 
holds that 

dTY{UkAC^,v*),UkAn)^0, 

and likewise for the graph-metric neighbourhoods Vk„{-). Thus, the infinite random graph C* is 
the Benjamini-Schramm limit of the random graph as n tends to infinity. 

Again this form of convergence is best-possible, as the diameter of Cn has order ^/n by 
Theorem 6.14. Benjamini-Schramm convergent sequences have many nice properties, for example 
we may apply general results by Kurauskas [46, Thm. 2.1]) and Lyons [54, Thm. 3.2] to deduce 
laws of large numbers for subgraph count statistics and spanning tree count statistics. 

6.4.3. Scaling limit and diameter tail-bounds. We apply our results to hrst-passage percolation 
on graphs. Let t > 0 denote a random variable which has hnite exponential moments. Given a 
connected graph G we may consider the hrst-passage percolation metric dFPP on G by assigning 
an independent copy of i to each edge of G, letting for any two vertices x, y the distance dFPp{x, y) 
be given by the minimum of all sums of weights along paths joining x and y. We let Dfpp(-) 
and Hfpp(.) denote the diameter and height with respect to the dppp-distance Theorems 6.9 
and 6.10 and the fact, that the diameter and height of the CRT are related by 

E[D(re)] = ^E[H(re)] = 


readily yield the following result. 
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Theorem 6.14 (First passage percolation random unlabelled rooted graphs). Suppose that the 
weighted ordinary generating series has radius of convergence p > 0, and that the series 

E{Z,U) = zZ(SEToB')«(«>C“"(z2),r‘"(z3), . . .) 

is finite at the point {p + e, C*^{p) + e) for some e > 0. Then there exists a constant a > 0 such 
that 

{Cf, an^^/^dppp) {Te, d%) 

in the Gromov-Hausdorff sense as n = l mod span(w) becomes large. Furthermore, there are 
constants C, c > 0 with 

P(D(C)), (ippp) > x) < C'exp(—cx^/n) 

for all n and x > 0. In particular, the rescaled height and diameter converge in the space Lp for 
all p > 1. We have asymptotically 

A A I 'TTfl 

E[Dfpp(C-)] ~ -E[Hfpp(C-)] ~ —. 

Lemma 6.11 also yields the following result for the size of the largest 2-connected component 
of the random graph Cf. 

Corollary 6.15. There is a constant C > 0 such that the largest block in the random graph 
has size at most Clogn with probability tending to 1 as n becomes large. Likewise, the maximum 
degree admits an 0(log(n)) bound with high probability. 

6.5. Applications to random unlabelled front-rooted fe-dimensional trees. We consider 
the species K, of front-rooted A:-trees and the subclass JC° where the root-front is required to lie in 
a single hedron. Let K„ denote a uniform random unlabelled /C-object with n hedra and likewise 
K° a uniform random /C°-object with n hedra. As discussed in Section 6.1.3, the two species are 
related by the equations 

/C~SETo/C°, A-(SEQ|fc|oSET)(/C°). 

This identifies the random /c-tree K° with the random enriched tree for the special case 
TZ = SEQj^j o SET. The random front-rooted A;-tree K„ may be interpreted as a random 
unordered forest of 7^-enriched trees. We let p denote the radius of convergence of KT{z) = Afi{z). 

6.5.1. Local weak limit. Theorem 6.5 readily yields a local weak limit of the random graph K° 
with respect to neighbourhoods of the root-front, or any fixed vertex of the root-front. The 
limit object is the infinite random /c-tree K° that corresponds to the limit Sym(7^)-enriched tree 
(T{°°),/3(°°)) according to the bijection in Section 6.1.3. 

The random unlabelled front-rooted /c-tree K„ may be viewed as a random unlabelled Gibbs- 
partition. By Theorem 4.1 it follows that K„ exhibits a giant component of size n + Op(l), 
and the small fragments converge in total variation toward a Boltzmann limit R that follows a 
¥ distribution. We let K denote the infinite random /c-tree obtained by identifying the 

OIL -L OA.y ,p 

root-front of K° with the root-front of the front-rooted /c-tree corresponding to R according to 
the bijection in Section 6.1.3. 

Theorem 6.16 (Local convergence of random unlabelled front-rooted /c-trees). For any sequence 
kn = o{y/n) it holds that 

rfTv(Vfc„(Kn), Efc„(K)) 0. 

Thus the infinite random graph K is the local weak limit of the random front-rooted k-tree as 
n becomes large. Here we may interpret 14^(-) as the neighbourhood of the root-front, or of any 
fixed vertex of the root-front. By exchangeability, it does not matter which we choose. 

Note that Theorem 6.16 does not follow directly from the above discussion, as we still need 
to relate the height of vertices in the ^-enriched tree representation with the height in the 
corresponding /c-tree. We provide a detailed justification in Section 7.3. 
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6.5.2. Benjamini-Schramm limit. The infinite ^-enriched tree H* from Section 6.2.2 may be 
interpreted as an infinite k-tree K* according to the bijection in Section 6.1.3. 

As we are going to argue in detail in Section 7.3, a random vertex in lies with high 
probability in the largest /C°-component. Theorem 6.8 together with a large deviation estimate 
yield Benjamini-Schramm convergence of the random k-tree K° toward the random graph K*, 
and hence also Benjamini-Schramm convergence for the largest /C°-component of K„. Moreover, 
within this component, a random vertex is unlikely to lie anywhere near the root. Thus K is also 
the Benjamini-Schramm limit of the random front-rooted k-tree K„. 

Theorem 6.17 (Benjamini-Schramm convergence of random unlabelled front-rooted k-trees). 
Let V* denote a uniformly at random selected vertex of the random unlabelled front-rooted k-tree 
K„. Then for any sequence of positive integers kn = o{y/n) it holds that 

dTy{VkA\<n,V*),Vk„{k^))^0. 

Thus K* is the Benjamini-Schramm limit of Kn as n becomes large. 

6.5.3. Scaling limit. By Theorem 4.1 we know that K„ exhibits a giant /C°-component of size 
n Op{l). Hence in order to establish a scaling limit for Kn it suffices to study the random 
k-tree K° where the root-front lies in a single hedron. In order to establish a scaling limit for K° 
we may not apply Theorem 6.9 directly, as the metric of K° does not ht in the general scheme of 
random metric spaces considered in Section 6.3. Rather than that, we make direct use of the 
size-biased ^-enriched tree of Lemma 6.4 and the results of Lemma 6.11. 

Theorem 6.18. There is a constant > 0 such that 

{Kn,akn-^/^dKj^{Te,dTj 
in the Gromov-Hausdorff sense as n becomes large. 

As a byproduct, we obtain the following properties of the random k-tree K°. 

Lemma 6.19. It holds that 

(K°,afcn-V2dKj A(re,dre). 

Moreover, there are constants C,c > 0 such that for all x > 0 and n > 1 

P(D(K°) > x) < C'exp(—cx^/n). 

6.6. Applications to simply generated Polya trees. Let (Ki)igN be a sequence of non¬ 
negative weights with kq > 0 and k* > 0 for some i >2. Hence k can be seen as a weighting 
on the species SET. Let d denote the greatest common divisor of the set of all indices i with 
Ki > 0. For n = 1 mod d large enough we may draw a random Polya tree Tn having n vertices 
with probability P(t„ = r) proportional to Ouer ^d+{v) ^^7 unlabelled unordered tree r with 

size n. This corresponds to the random unlabelled enriched tree for 77^ = SET'^. We let p 
denote the radius of convergence of the corresponding generating series A(z) := A^{z). 

In the following, we only consider the case where 

(6.9) p>0 and {A[p) + e,A{[p + e)^), ...)< co 

for some e > 0. 

6.6.1. Local weak limit. Let r denote the tree for the special case 77'^ = SET'^. Theorem 6.5 
readily yields the following result. 

Theorem 6.20. If Condition (6.9) is satisfied, then for any sequence kn = o{y/n) of positive 
integers it holds that 

dTv(14„(rn),14„(r)) -^0. 

Thus, T is the local weak limit of Tn as n becomes large. 
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6.6.2. Benjamini-Schramm limit. Let f* denote the pointed plane tree corresponding to the tree 
H* for the case TZ^ = SET'^. Theorem 6.8 applies directly and yields a Benjamini-Schramm 
limit for the simply generated Polya tree r^. 

Theorem 6.21. Suppose that Condition (6.9) holds. Let v* denote a uniformly at random 
selected vertex of the tree Tn- Then for any sequence kn = o(^/n} of positive integers it holds that 

dTy{VkArn,vn,Vk„in)^0. 

In particular, f* is the Benjamini-Schramm limit of Tn as n becomes large. 

6.6.3. Scaling limit and diameter tail-bound. Another application of Theorems 6.9 and 6.10 is 
the following scaling limit with a sharp tail-bound for the diameter. 

Theorem 6.22 (Scaling limits of simply generated Polya trees). If Condition (6.9) is satisfied, 
then there is a constant o > 0 such that 

{Tn, an~^/'^dr„) {Te, drf) 

with respect to the Gromov-Hausdorff metric as becomes large. Moreover, there are constants 
c, C > 0 such that for all x >0 and n it holds that 

P(D(t„) > x) < C'exp(—cx^/n). 

This is a mild extension of results for uniformly drawn Polya trees with n vertices and vertex 
degree restrictions, whose scaling limits were studied by Marckert and Miermont [55], Haas and 
Miermont [39] and Panagiotou and Stufler [59]. 

7. Proofs of the main results 

7.1. Proof of the local convergence of unlabelled enriched trees in Section 6.2. 

Proof of Lemma 6.1. As discussed in Section 3.5.3, the rules of Lemma 3.4 concerning the 
interplay of Boltzmann distributions and operations on species may be used to construct recursive 
samplers, if the concerning isomorphism satisfies the conditions of Theorem 3.5. This is the case 
for isomorphism 

which corresponds to the combinatorial specification y = 'H{X,y) with 'H{X,y) = X ■ IZ'^{y). 
Indeed, it holds that 77(0,0) = 0 • 7^''(0) = 0 and 5277 ( 0 ,0) = 0 • (77.')'^(0) = 0. 

Thus we may apply the product rule and substitution rule of Lemma 3.4 to construct a 
recursive procedure that samples according to the -Boltzmann distribution. The 

result is the procedure described in Lemma 6.1, with one important difference. According to 
Lemma 3.4 we would have to apply Lemma 3.3 for each recursive call to construct an Ati- 
symmetry out of the 7^-symmetry and the attached A-T^-symmetries, and then relabel uniformly 
at random. Instead of doing this for each recursive call, we skip this step and keep track of all the 
77.-symmetries, yielding the Sym(7?.)-enriched tree (T,/3). As discussed at the beginning of the 
Section 6.2.1, we may construct the symmetry that corresponds to (T, /3) in one step by applying 
Lemma 3.3 for each of its vertices, starting with the leaves and working our way upwards. Thus 
(T, /S) corresponds to a symmetry on the vertex set of the plane tree T. Moreover, instead of 
relabelling uniformly at random after each application of Lemma 3.3, we may postpone this step 
and just relabel the symmetry corresponding to (T, P) once uniformly at random. The result 
rZ_ 4 !j^(x) then follows -Boltzmann distribution, meaning that for each n > 0 and 

each symmetry {A, a) G Sym(A 7 e)[n] it holds that 

j.n 

^{TZji^ix) = (A,a)) = uj{A) — Z_a^{x,x^,. . 

By Equation (3.1) we know that 


This completes the proof. 


Za^{x,x‘^,...) =Ati{x). 


□ 
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Proof of Lemma 6.2. As for claim (1), note that the isomorphism — X • TZ'^[Af^ implies 
that 

.. .). 

By assumption, there is an 7^-structure with size zero and one with size at least two such that 
both have a positive K-weight. It follows that there are constants a, 6 > 0 and k >2 with the 
property that for all 0 < x < p it holds that 

Afi{x) > x{a + bAjiix)^). 

This implies that \\m.x^pAfi{x) < oo and hence, by non-negativity of coefficients, A^(p) < oo. 

Claims (2) - (6) sA follow after a moments consideration from the explicit description of the 
sampler in Lemma 6.1. □ 

Proof of Lemma 6. 3. Claims (1) and (2) follow from a general enumeration theorem by Bell, 
Burris and Yeats [12, Thm. 28] which implies the asymptotic behaviour of the coefficients of the 
power series Af^{z). 

Claim (3): The expressions for the moments of ^ and C follow from the equations describing 
the corresponding probability generating functions in Lemma 6.2. In order to verify that E[.^] = 1, 
note that by Pringsheim’s theorem the function Afi{z) cannot be analytically continued in a 
neighbourhood of p, and hence by the implicit function theorem it must hold that the function 

H{z, u) := u — E{z, u) 

satisfies 

iL„(p,A^(p)) = 0. 

In other words, E[^] = Eu{p,A^{p)) = 1. (Compare with the proof of [12, Cor. 12].) 

Claim (4): Let A denote the lattice spanned by all differences x—y of vectors that (^, C) assumes 
with positive probability. We assumed that there is an 7^-structure with size zero and positive 
K-weight. Its automorphism is the empty map with no cycles at all. Hence E((^, C) = (0, 0)) > 0, 
and A is actually generated by the support of (C,C)- 

We assumed further that at least one 7^-structure R with positive k- weight has a non-trivial 
automorphism group. Hence there are positive numbers a*,b* c* such that (^,C) assumes 
(a*, 0) and {b*,c*) with positive probability, as these points correspond the trivial and a non-trivial 
automorphism of R. Hence the lattice A contains three points that do not lie on any straight 
line. Consequently, it has rank 2, that is, A ~ as abelian group. 

Let B G Z^^^ be a basis of the lattice A. As the support of (^, C) is not contained on a straight 
line, it follows that the covariance matrix S is positive-definite. Let denote a family of 

independent copies of (^,C)- By Lemma 6.2 it holds for any positive integer n and i that 

i m 

P(1T1 = n, IT-^l = L) = P(^(^j,0) = (^ - l,n - £), > m for all m < L). 

i=l i=l 

Using rotational symmetry and the cycle lemma 5.4, it follows that 

1 ^ 

(7.1) P(iri = n, \Tf\ =l) = Q = (^ - 1, n - 1)). 

i=l 

We know by Claim (1) that if n is larger than some fixed constant, then EOT] = n) > 0 if and 
only if n — 1 is divisible by span(w). We shall check below that for each such n 

(7.2) e K] = n + U 

with d = j det Bj/span(w). For now, let us assume that (7.2) holds. Let M > 0 be a fixed 
constant. Then it holds uniformly for all t = {1 + E[(^])n + Xy/n with jxj < M that 

^-n/Vl + EK] 

and 

r 1/2 ((^ _ 1 , n - t’) - 7E[(^, C)]) ~ (0, -x(l + E[C])=i/2). 
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The central local limit theorem given in Lemma 5.1 yields that 


(7.3) 


i=l 


(l+E[C])|detB| 
n‘^\/27r det S) 2 cj 2 


with (T^ = det S/(V[C](1 + E[((])^). By Claim (3) we know that 


P(|r| = n) ~ 

Using Equation (7.1) it follows that 

ni7'/i = £) 


spa 


/ X — 3/2 /l + E[(^] 
n(w)n \l -pV-. 


d 

^ — exp(-;^). 
ay2TTn 2cr^ 


The central limit theorem now follows from Equation (7.2), as for any fixed a < b 




d 


n 




1 

^;^exp(-;^)dx, 
aV2'jr 2cj^ 


with the sum index x ranging over [a, b] n ((1 — + Zd/ \/n). 

It remains to verify Equation (7.2), which requires careful reasoning, as we have to relate 
span(w) with the involved lattice. In order to simplify our calculations, we would like to pick a 
’’nice” basis B of A. Note that it does not matter for (7.2) which basis of A we choose, as for any 
two bases Bi and B 2 there is a matrix M G GL 2 (Z) with Bi = MB 2 , and as detM G {1, —1} 
it follows that | detBij = | detB 2 |. The inconvenient part is that, contrary to vector spaces, not 
every linear independent subset of a lattice may be extended to a basis. However, a classical 
algebraic result states that for any free Z-module M with rank r{M) and for any submodule 
N C M with rank r{N) there is a Z-basis ui,..., Vr(M) of M and integers Ai,..., Xr(N) such 
that AiUi,..., AfcUfc is a basis of N. See for example Roman’s book [62, Thm. 6.7], which states 
this in the more general context of modules over principal ideal domains. If B = (bi,b 2 ) is such 
a basis of A for the submodule N := Z(a*,0)T C A (recall that we defined a* at the beginning of 
the proof, when we showed that A has rank 2), then there is an integer A with Abi G Z(a*, 0 )'''. 
But this implies that the second coordinate of bi must be zero, and hence 

P.4) B^(“ 0 


with a, 6 , c > 0 is an upper-triangular matrix. For any i, n it holds that (f—I,(n —I) — (I"—I))t gA 
if and only if (£ — I, n — I)''' G A' := CZ^ with 

/I fa b\ _ fa b \ 

V-I 1 ) ^0 cj Va b + cj- 


We may easily calculate that 

(£ — I, n — I) G CZ^ if and only if n — I G Z gcd(a, 6 -|-c), £ G re-|-Zac/gcd(a, 6 -|-c). 


Indeed, it necessarily holds that re — I G Za -|- Z (6 -|- c) = Zgcd(a, 6 -|- c). Conversely, if 
re — I = Aqo -|- fiob, then any pair (A, /i) G Z^ satisfies Xa + fib = n — 1 precisely if there is an 
arbitrary integer t with A — Aq = t{b + c)/gcd(a, b + c) and fi — fio = ta/ gcd(a, b -1- c). Hence 
(£ — I, re — I)T = C(A, /r)T reduces to £ = n + act/ gcd(a, b + c) with t G Z. 

Since det(B) = ac, it remains to check that span(w) = gcd(a, b + c). That is, we show that 
span(w) is the span of the projection of A' on the y-axis. The support S' of (jT'^j — I, |T| — I) 
is a subset of the lattice A'. Hence span(w), which is the span of the projection of S to the 
y-axis, is a multiple of gcd(a, b + c). In order to show equality we need to verify that indeed two 
successive multiples of gcd(a, b + c) exist that |T| — I assumes with positive probability. We may 
do this probabilistically. Let C > 0 be a given constant. By Equation 7.1 and the central local 
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limit theorem given in Lemma 5.1 it follows that there is a positive integer L such that for all 
i > L and all n with (/ — 1, n — 1 )t G A' and 

(7.5) C)])II < c 

it holds that P(|T| = n, IT-^I = £) > 0. There is a constant c > 0 such that for every n at least 
c^/n consecutive integers £ > L exist that satisfy Inequality (7.5). If n — 1 is additionally a 
constant multiple of gcd(a, 6 + c), then the intersection T^ of the lattice A' and the affine space 
(0, n — 1)T + Z(1,0) is non-empty, and hence an affine subspace of rank one, whose span does not 
depend on n. In particular, r„ hits the cone (7.5) about a constant fraction of ^/n many times. 
This proves that span(w) = gcd(a, b + c) and concludes the proof of Claim (4). 

□ 


Proof of Lemma 6.4- The task is to provide a sampler for the species Aff using the isomorphism 

iW 


Jtg = {X ■ 

= Afi and for £ > 1 it holds 


Note that for £ = 0 we have A^ = A^ and for £ > 1 it holds that 


Thus for £ > 1 we may apply the product rule and substitution rule of Lemma 3.4 to construct 
a procedure that samples according to the -Boltzmann distribution, employing a 

Pg^m(_ 4 (^-i)) -distributed symmetry, which we may sample by a recursive call to the sampler 
for I — 1. 

The result is essentially the procedure described in Lemma 6.4, but we make some modifications. 
According to Lemma 3.4 we would have to apply Lemma 3.3 to construct an ^ 7 ^-symmetry out of 
the TT-Csymmetry, the attached .AT^-symmetries, and the a!^ ^^-symmetry, and relabel uniformly 
at random afterwards. Instead of doing this for each call, we skip this step and keep track of all 
the 77.'-symmetries. Also, instead of taking A-T^-symmetries directly, we use the Sym(77)-enriched 
tree (T,/3) from Lemma 6.4. This yields the Sym(77)-enriched tree 

As discussed at the beginning of the Section 6 . 2 . 1 , we may construct the symmetry that 
corresponds to in one step by applying Lemma 3.3 for each of its vertices, starting 

with the leaves and working our way upwards. If we additionally relabel uniformly at random, 
the resulting symmetry TZ (e) follows a P . (t) . . -distribution. 

We defined A^ as a subspecies of the species of pointed 77-enriched trees. In particular, for all 

n > 0 and any symmetry ((A, r),a) G Sym(A..^^) we have that (A, a) is an A^-symmetry and r 
is a fixpoint of the antomorphism a. It follows from the definition of the Boltzmann distributions 
and 

A^^{x) = {x{WY o A^(x))^A^(x) 

that 

■'(P\ 1 n 

IP’(r^_ 4 w(x) = ((A,r),c 7 )) = w(A)—Ay(a;)“^ = {x{n'Y o A%.{x))-^P{TZj^<^{x) = {A,a)). 

□ 


Proof of Lemma 6.6. Claim (1)\ By Lemma 6.4, the 77-symmetry (R, A) of G follows, up to 
relabeling, a weighted Polya-Boltzmann distribution for the species {TZ'Y with parameter Af^{p). 
It makes no difference whether we distinguish the vertex corresponding to the *-hxpoint, or a 
uniformly at random drawn hxpoint, as the results are identically distribnted. By the discussion 
in Section 3.2 it holds that 

d 

Z{Tl')'^{si,S2, ■ ■ ■) = -g^Z'ji'^Yl, S2, . . .). 

Hence, for any 77-symmetry {R, a) with a marked fixpoint n, the probability that (R, a) = {R, a) 
and that precisely the fixpoint u gets marked, is simply given by the probability that a Polya- 
Boltzmann distributed 77-symmetry with parameter Af^{p) equals {R,a). Consequently, for any 
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^-object G with a marked fixpoint u the probability that G assumes G and that precisely the 
vertex u is marked is given by P(G = G). 

Claim (2): This is a reformulation of Lemma 6.4 in terms of ^-enriched trees. 

Claim (3): The event = (r, 7 )^^^ corresponds to precisely Lr{k) different 

outcomes of the first k — 1 levels, depending on which leaf of the spine of the fixpoint tree 
is supposed to pass through. By Claim (1), each of these events is equally likely with 
probability = ^i)- 

Claim (4)'- The k + 1-th level of interpreted as ^-enriched tree is obtained 

by taking for each fixpoint of the k-th. level an independent copy of G, except for the unique 
distinguished fixpoint, which receives an independent copy of G. This yields 

E[Lk{T)] = (E[Lfc_i(r)] - l)E[e] + E[^l = ... = k{E[i] - 1) + 1, 

E[\t^] = E[Lo(r) + ... + Lkir)] = k{k + l)(E[e1 - l )/2 + k + l, 

E[lI{t)] = {E[Lk] - 1)E[C] +E[C] = ... = k{E[i] - 1)E[C] +E[C], 

k 

E[hS{t) - |r[*^]|E[C]] = j;(E[Lf (r)] - E[L,(r)]E[C]) = {k + l)(E[a - E[C]). 

i=0 

For the variance, note that 

Y[H^{t) - Irt'^llEK]] = E[{Do + ... D^f)] 

with 

A :=Lf(r)-A(r)E[C]-E[C]+E[C] 
satisfying E[A | Ei(r) = P\ = ^ for all i. In particular, for all i < j, 

E[D,D, I ( 7 (u))„g,[.-i,)] = D,E[D, \ L,(r)] = 0. 

It also holds that 

E[A"| A(r)]=A(r)V[C]+V[C], 

and consequently 

k k 

V[Af (r) - |rW|E[C]] = Y.E[Df] = j;(E[A(r)]V[C] + V[C]) 

2=0 2=0 

= k{k + 1)(E[|] - l)V[C]/2 + {k + 1)(V[C] + V[C]). 

□ 

Proof of Theorem 6.5. Let £ denote the countably infinite set of all ^-enriched plane trees and 
set 

A = I A G T}. 

We have to show that 

(7.6) lim sup |E((7;, e 7 ^) _ ip(( 7 -(oo)^ ^(oo))<fc„> ^ ^ 

Recall that for any ^-enriched tree (r, (Gt-(u))^) with Gr{v) = {(3riv), friv), Ft{v)) and any 
integer A: > 0 we set 

k 

= |{^^ e r I h^(u) = A:}|, Lf{r)= ^ 1^(1^)!, and (t) = ^ if (t). 

vGt i=Q 

hT{v)=k 

By assumption, there is a sequence tn ^ 0 with kn = y/ntn- We may without loss of generality 
assume that kn tends to infinity. For any C > 0 and all k and n we define with foresight the set 
£c,k,n of all (r, 7 ) E A satisfying p(( 7 '(“),/ 3 (~))<*^> = (t, 7 )) > 0 and 

1 < A(t) < G{ntn)^/^, |iL^_^(r) - |rI^“^'|E[C]| < < ntn. 
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Using Markov’s and Chebyshev’s inequalities and the expressions of the moments in Lemma 6.6, 
it follows that there is a constant C such that 

(7.7) hm ip((7-(~),/3(-))<^"> e = I- 

Hence, if we verify (7.6) with the index % ranging only over all subsets of £c,kn,m then (7.7) 
implies that 

(7.8) hm F{{Tn, /3n)<""> ^ £c,k^,n) = 1 

n^oo 

and consequently (7.6) already holds with the index ranging over all subsets of £k„. So the only 
thing that is left to show is that (7.6) holds for 77 C £c,kn,n- That is, we have to verify that for 
any e > 0 it holds for large enough n that 

(7.9) sup |P((7;,/3n)<^"> g^) _P(( 7 -(~),/ 3 M)<fcn> ^ 

'HG£c,k„,n 

Let B,5],|U, (T and d = | det B|/span(w) be as in Lemma 6.3. We have shown in this Lemma 
that, as n = 1 mod span(w) tends to infinity, 

(7.10) VnlP(|7;7| =£) ~-^exp(-^) 
uniformly for all bounded xg with 

f := /xn + xg^/n E n + dZ. 


Moreover, 



d 


A7(0,ct2) 


and for any ei > 0 there is a constant M > 0 such that for all n 

T(|r/|^4)<ei with 4:=(n + dZ)n[n/(l + E[C])-M\/ii,n/(l + E[C]) + M\/^]. 


Hence the expression in (7.9) may be bounded by 


(7.11) 


ei + sup 

(T,7)e£c,fc„.n 


IP((rn,/ln)<^-> = (r,7),|7;^| E4) 
E((r('^),/3(“) )<*="> = (r,7)) 


Let (r, 7) E £c,k„,n and let Gi = {Ri, fi, Ft), 1 < i < t he the depth-first-search ordered list of 
its ^-objects. We set {h,H) := Yli=ii\fi\^ 1^)1)- Moreover, let = (Ri,fj, Fj) denote a family of 
independent copies of the random ^-enriched tree G. For each ^-enriched plane tree G set ttq = 
E(G = G). By Lemma 6.2 it follows that for all £ the probability E((T, /3)<*’"> = (r, 7), \T^\ = i) 
is given by 


i 

ttGi • • • 7rGtE( (|fi|,|Fi|) 


m 

Y, f. > m — h for all 1 < m < £). 


Using rotational symmetry and the Cycle Lemma 5.4, this may be simplified further to 

I 

vtgi • • • ^ (|f,|, |Fi|) = (i-l-h,n-i- H)). 

i=t+l 


The tree r has precisely h — t + 1 many leaves with height k and thus Lemma 6.6 implies that 

p((^{oo), /3(oo))<fc„> ^ t + 1). 

It holds uniformly for (r, 7) E £c,kn,n and i = fin + xg^/n E In that 

r V2t)E[(e, C)] I ~ (0, -x,(l + EK])^'/^) 
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as n becomes large. Hence, as i — t ^ i, we may apply the bivariate Central Local Limit 
Theorem 5.1 to obtain 


1 


e 

i=t+l 


By Lemma 6.3 we know that 


{£-1 


h, n 


£-H))^ 


(l+E[C])|detB| 
n^\/27r det S ^ 2 (T^ 


P(|r| =n) 


spa 


, ^ — 3/2 /l + E[(^] 


It follows that uniformly for all (r, 7 ) G £c,k. 

= (r,7),|7;^| e/n) d 


P((r(~),/3(“))<^"> = (r, 7 )) 


E 

i&In 


1 f \ 


r-M 


1 


l-M cTv2vr 2cr^ 


Taking ei = e/2 and M sufficiently large it follows that the bound (7.11) is smaller than e for 
sufficiently large n. This completes the proof. □ 


Proof of Lemma 6.7. Claims (1) and (2) are straight-forward. For Claim (3), note that the 
event means that and that uq lies at one of the p(H) many locations, 

and that u* lies in precisely the location in {uq} U F{uq) that corresponds to tt, with F{uo) 
denoting the forest of non-fixpoints of the ^-object corresponding to uq. Let Gi,... ,Gt denote 
the depth-first-search ordered list of the ^-objects Gi = (Si, fi, Fi) of Hk and let ..., Gi^ be 
the segment that corresponds to Hq. For any ^-object G = (S,f,F) with a marked fixpoint v 
from /, the probability for G = (S,f,F) to assume G and that a uniformly at random drawn 
hxpoint of / equals v is given by 

p(G = G)|/ri =p(G = G). 

Likewise, if we distinguish a vertex v from {*} U F, then the probability for G = (S,f, F) to 
assume G and that a uniformly at random drawn vertex from {*} U F equals v is given by 

P(G = G)(l + |F|)-^ = P(G = G)(l + E[C])-\ 


Thus 

P((FI[,],u*) = (H,u)) = p(H)P(G = G4)(1 + E[C])-' = ^*)- 

For Claim (f), let us start with the fixpoints. By assumption, there is a sequence tn ^ 0 such 
that kn = tny/n — )• 00. For any integer m > 0 let Sm denote the sum of m independent copies of 
the size of the ^-Galton-Watson tree T-^. The number of fixpoints #/Flfc is given by the sum of 

#,H„Sl+S|,|. 

and the independent differences 

#/Hi — 

Consequently, 

#/Hfc = 1 -I- SiMk 

with Mfc = |f| -P - 1) and denoting a family of independent copies of |f|. By a 

general result for the size of Galton-Watson forests, there is a constant G > 0 such that 

E(<S'm > x) < Gmx~^l‘^ 

for all m and x > 0. See Devroye and Janson [24, Lem. 2.3] and Janson [42, Lem. 2.1]. As 
and |f| have finite first moments, it follows that 

P(5m, >x)< GE[Mfc]x-i/2 = c(E[|f|] + k(E[i] - 

Setting X = ntn — 1 and A: = fenj h follows that < ntn with probability tending to one as 

n becomes large. 
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For the second statement, let Rm denote the sum of m independent copies of | T| — (1+E [^]) | \ ■ 

The difference #Hfc — (l + E[C])#jHfc is given by the sum of 

#Ho - (1 + E[C])#/Ho = |F| - E[C] + i?|f| 
and the independent differences 

#H, - #H,_i - (1 + E[C])(#/H, - |F| - E[C] + i = l...k. 

Consequently, 

k 

(7.12) #H, - (1 + E[C])#/Hfc ^=Vl - E[C] + - E[C]) + Rm, 


with Q denoting independent copies of |F|. Markov’s inequality implies that for k = kn 

(7.13) ||F|-E[C] +j;(Ci-E[C])| < V^/2 

i=l 

with probability tending to one as n becomes large. 

It remains to show that \Rm,^ \ < y/ntnl"^ with probability tending to one. If ^ and ( were 
independent, then this would be rather simple. But, as this is not necessarily the case, it requires 
a bit of effort. We may write 

OO 

(7.14) |r|-(l + E[C])|r^|= with D, = Lf(T^)-U(T^)Eia 

i=0 

The sum is finite, as A = 0 for i > H(A). Since 

E(H(r^) > h)^2/(Vl^]h) 


as h becomes large, it follows that the probability for the maximum height of m independent 
copies of A to be less than A/vA is given by 


(7.15) 


(1 + o(l))2\/fn\ 

W\K ) 


with the o(l) term not depending on m. As E[Mfc„] ~ A;n(E[|] — 1) and V[Mfc„] 
follows by Chebyshev’s inequality that 

|Mfc„ - kn\ < kl/^ 


fcnV[^], it 


with probability tending to one. Hence Expression (7.15) implies that the probability for the 
maximum height of independent copies of A to be smaller than A/vA tends to one as n 
becomes large. Equation (7.14) hence implies for all x that 


(7.16) F{\Rm,^ I > x) < o(l) +F{\Ui + ... + UkJ >x) 

with the Uj denoting independent copies of Di. We aim to apply Chebyshev’s inequality 

again, and hence compute the expected value and variance of the Uj. It holds for all i and i that 

£ 

E[A I A (A) =l]= E[^ 0 - ^E[C]] = 0 , 

i=i 


with (Cj)j denoting a family of independent copies of C- Consequently, for all k 


k k k 

E[^ A] = 0 and V[y] A] = E[A^] + 2 ^ E[A A']- 

i=0 i=0 i=0 

For all 0 < i < j it holds that 

e[aa I ((A)''’, (/3(A).6(r/)b-i] = AE[a I AA^)] = 0, 
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because for all i 


Clearly it also holds that 
Hence, 


E[D, I Lj{T^) =i]= Ci - ffi[C]] = 0. 

r=l 

E[D] I Lj{T^) =i]= £V[C]. 


v^A] =v[c]E[^L,(r^)] = [k + mc]. 


i=0 


i=0 


Setting k = [kn/'/t^\, we may thus apply Chebyshev’s inequality to (7.16) and obtain 

> Vntnl2) < o(l) + — = o(l). 

Together with (7.12) and (7.13) this implies that 

- (1 +IE[C])#/Hfc„| < Vrvtn 

with probability tending to one as n becomes large. □ 

Proof of Theorem 6.8. For any k, let A denote the set of pairs (H,u) where H = (A)o<j<fc is a 
representation of a ^-enriched tree as sequence of increasing enriched fringe subtrees, and u is 
either the root of Hq or an element of the non-fixpoints of the ^-object corresponding to the 
root of Hq. We need to show that 

(7.17) lim sup |P((Hf,^],u*)G?7)-P((H[fc„],A)G?7)|=0. 

By assumption, there is a sequence tn —^ 0 such that kn = y/ntn. We define the subset 

e £k I iffHk < ntn, \ ffHk - (1 + E[C])#/77fc| < Vr^tn}- 

Lemma 6.7 implies that (H[^^],u*) lies in £kn,n with probability tending to one as n becomes 
large. Hence, if we verify Equation (7.17) with the index H only ranging over the subsets of 
£kn,n: then it follows that (H|^ also lies with probability tending to one in £k„,n- But this 
already verifies (7.17) when H ranges over all subsets of and we are done. 

So it remains to show that for any e > 0 it holds for sufficiently large n that 


(7.18) 


sup |P((Hf,^],u*) en)- P((H[fc„],u*) E 77)1 < e. 


In order to show this, we first exert some control over the number of fixpoints in {Tn,Pn)- Let 
B, S,/i and a be as in Lemma 6.3. If follows from this Lemma that, as n = 1 mod span(w) 
tends to inhnity, 

V^E{\Tj\=£) 

uniformly for all bounded E M with 


d / xj . 


Furthermore, 


£ := /rn + x^y/n E n + dZ. 


n 


and for any ei > 0 there is a constant M > 0 such that for all n 

E{\Tj\^ In) <€1 with In:={n + dZ)n[n/il + E[C])-MV^,n/il + E[C]) + My/fi]. 
Hence we may bound the expression in (7.18) by 


(7.19) 


ei + sup 
(H,n)ec£fc„ 


P((H 


[kn 


= (H,u),|rr/| E4 


P((H[fc„],n*) = (H,u)) 


- 1 
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Throughout the rest of the proof, we set k = kn- Let (H,u) G Sk^n with H = (-ffi)o<i<fc be 
given. We set L = Lk{Hk) + L^{Hk) and let Fq denote the size of the forest of the ^-object 
corresponding to the root of Hq. Let A denote a ^-enriched tree with a total number of vertices 
= n. Given {Tn, Pn) = A, the vertex v* is drawn uniformly at random from A. Given 
{l~n, Pn) = A and = Hk, the vertex v* is drawn uniformly at random from L possible locations. 
Out of these, exactly p(H)(l + Fq) many correspond to the event H^j = H, since the number 
p(H) defined in Lemma 6.7 counts the number of fixpoints v at height k in Hk with the property, 
that the extended enriched fringe subtree representation with respect to v is identical to H. 
Moreover, given additionally Hj^j = H, there is precisely one out of 1 + Tq possible locations such 
that V* = u. Hence 

(7.20) P(Hf,] = H I (Tn, fin) = A)= P(H^ = Hk \ Hn) = A)p{U)/L. 

There is a 1 to L correspondence between the fixpoints u of H with f{A,v) = Hk, and the 
possible locations for v* such that = Hk- Hence 

(7.21) P(H^ = Hk I (Tn, Pn) = A) =E[Y^ l/((r„ A),.)=h, I {Tn, Pn) = A]L/n 

v&tJ 

Let {Gi,... ,Gt) and G”, i = 1,..., \ Tn \ denote the depth-first-search ordered lists of ^-objects of 
the enriched trees Hk and tJ■ The occurrences of (Gi,..., Gt) as substrings of (G”)i correspond 
precisely to the vertices of tJ) where the fringe-subtree equals Hk- As Equations (7.20) and 

(7.21) hold uniformly for all ^-enriched trees with n vertices, it follows that 

l 

(7.22) P(Hf,] = H, \Tj\ eIn) = Y. E[ J", \TJ\ = £]p{U)/n 

eein i=i 

with J" denoting the indicator variable for the event (G”,..., Gj^^_^) = (Gi,..., Gt)- Here we 
set G” := G”_^ whenever i > i. Note that Jj = 0 whenever j -|- t — 1 > .^, as then the sequence 
does not correspond to any ^-enriched tree at all. Hence the sum of the Jj really 
counts the number of occurrences of (Gi)i<j<i in (Gi)i<j<£. Recall that by Lemma 6.2 there is a 
natural coupling of the unconditioned ^-enriched tree (T, P) with a family (Gi)jgf^ of independent 
copies Gj = (Si,fj, Fj) of the random ^-object G, conditioned on the event that there is an 
initial segment of (Gi)igr!} that corresponds to an ^-enriched tree. Let (Jj)j=i ^ denote the 
unconditioned pendants of J" for the sequence (Gi)i<j<£. Lemma 6.2 implies that 

I i I m 

Jh\'^ n\ =^]= n\Ti\ = ^)K[J] Jj I |F,|) = (£-1, n-^), |fi| > m for all m < £]. 

j = l j = l 2=1 2=1 

The sum Jj is invariant under cyclic permutations of the list (Gi,..., G^). Hence the Gycle 
Lemma 5.4 yields 

t t l 

I'T'/I = ^] = = ^)eE I |Fd) = (^ - 1, n - i)]. 

j=l j=l i=l 

Conditioned on this simpler event, the Jj are all identically distributed. Hence 

f £ 

®E \'^n\ = ^] = mrj\=miJi I iFii) = {i-i,n-e)]. 

j=l i=l 

Setting ttg = P(G = G) for all ^-objects G, it follows that 
£ £ 

E[Ji, |Fd) = (£ - 1, n - £)] = ttgi • • • vrG,P( (|fd, |Fi|) = {i-\-t,n- #Hk)). 

2 = 1 2=£+1 

It holds uniformly for (H, u) G £kn,n and i = pn + X£^/n G In that 

r V2 #Hk) -{i- t)E[(^, C)]| ~ (0, -xpi + E[C])3/2) 
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as n becomes large. Hence, as i ■ 
theorem 5.1 to obtain 

i=t+\ 


t ^ i, we may apply the multivariate central local limit 






Likewise, the probability lP(X]i=i(|fi|, |Fi|) = {i — l,n — i)) has precisely the same asymptotic 
order. It follows from Equation (7.22) that 


Since Ijn 
(7.23) 


P(Hf,] = H, \Ti\ G In) ~ p(H)7rG, • • • ttg, ^P(|r/| = i). 

i&In 

(1 +E[C])“^ and F{\tJ)\ ^ In) < ei h follows that uniformly for (H, u) G £k„,n 

P(Hp^] = H, \rj\ G In) 
p(H)'(l + ]E[C])-^7rGi •••VTGt ^ 


for n large enough. Setting ei = e/2, Inequality (7.23) and Lemma 6.7 imply that the bound in 
(7.19) is smaller than e for large enough n. This verifies (7.18) and hence the proof is complete. □ 


7.2. Proofs of the scaling limits and diameter tail bounds in Section 6.3. 

Proof of Lemma 6.11. Claim (I): Let x > 0 be arbitrary and let £ denote the event that there 
is a vertex v G with \f{v)\ + |E(u)| > x. It follows from Claim (1) that 

F{\T\ =n) = [z-]Af,ipz)/Af,ip) = ©(n'^/^) 

and hence 

p(T I \T\ =n) = 0{n^/^)F{£,\T\ = n). 

By Lemma 6.2, the probability P(T, \T\ = n) is given by 

l i m 

F{3i < n : max(^i + (i,... Ai + Ce) > x,^Ci = ^ - + Ci) = n,\fm < i : > m), 

2 = 1 2=1 2=1 

with (^j, Ci)ieN denoting a list of independent copies of (^, C)- We are not interested in precise 
asymptotics here and hence bound this very roughly by 

P(T, IT] = n) < P(max(^i + Ci, • • •, + Cn) > a;) < nF{( + C>x). 

As (^, C) has finite exponential moments, it follows that 

P(T, \T\ = n) < ex.p{—cn) 

for some constants C, c > 0 that do not depend on n. 

Claim (2): We may form a random metric space Y by constructing a metric dy on the vertex 
set of r5(p) by patching together independent copies of the metrics 6ji just as in the construction 
of the metric space Y„. Hence Y„ is distributed like the space Y conditioned on having size 
n. For any vertex v of the fixpoint tree let denote the dy-diameter of the subspace 
{u} U F{v) C Y. Given /i > 0 let £' denote the event that Dy > h for at least one vertex v G . 
Using Claim /!/ it follows that 

(*) ¥{£' I |Y| = n) = C>(n3/2)P(T', |Y| = n) = 0{n^/^)F{Do > h) 

with o denoting the root of the fixpoint tree . 

By assumption, for any vertex u G E(o), the distance dy{o,u) is bounded by the sum 
of d^(e) many independent copies of a real-valued random variable x ^ 0 having finite 
exponential moments, with the sum index e ranging over all ancestors of the vertex u in the tree 
T. Clearly we have that 


'^d^e) < |F(o)| 
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for all u G F{o). Since Do < 2sup„g^(o) dy{o,u), it follows that 

OO 

nOo >h)<J2 IP(l^(o)l = k)kF{xi + ... + Xk> h/2) 

k=0 

with (xi)*eN a family independent copies of y. Moreover, 

P(|F(o)|=fc)=P(C = fc) = 0(7t) 

for some constant 0 < 71 < 1. By the deviation inequality given in Lemma 5.3 it follows that 
there are constants a, 6 > 0 such that 

7fP(xi + ... + Xk> h/2) < 

for all k and h. Hence 

noo >h) = 0(72^) 

for some constant 0 < 72 < 1. Hence Equality (*) implies that 

F{S' I |Y| = n) = 0(n^l'^)x2 

and we are done. 

Claim (3y. Set t := y^(l + E[C])(t/ 2 with = V[.^]. Let gr : K.* —>■ M denote a Lipschitz- 
continuous, bounded function defined on the space of isometry classes of pointed compact metric 
spaces. Note that conditioned on having size t is distributed like conditioned on having 
size 1. Hence it is distributed like a ^-Galton-Watson tree conditioned on having ^ vertices, 
which we denote by T/- It follows from claim ii) that 

E[g{tTj/v^)] = 0 ( 1 ) +Y,n9{tv/v^)n\Tk\=i) 

I 

with the sum index I ranging over all integers £ = 1 mod span(w) contained in the interval 
(1 ± n~^) • Since g is Lipschitz-continuous, we have that 

|E[5(t7^7V^)] -E[5(a7^7(2v/£))]| < a„,,E[D(7^')/7^] 

for some constants with sup£(a„ 7 ) —)■ 0 as n = 1 mod span(w) tends to infinity. Moreover, 
the average rescaled diameter E[D(7^')/\/7] converges as t becomes large to a multiple of the 
expected diameter of the CRT Te- In particular, it is a bounded sequence. Since E[gr(^^)] 

converges to E[g'( 7 ^)] as ^ becomes large, it follows that E[ 5 (^^)] converges to E[g'( 7 ^)] as n 
becomes large. This concludes the proof. □ 

Proof of Theorem 6.9. By Lemma 6.11 it follows that with high probability all vertices v G Tn 
have the property that the dY^-diameter of the subspace {u} U F{v) is at most 0(log(n)). 
This implies that the Gromov-Hausdorff distance between the metric spaces (Y„, dY„/\/n) 
and {Filjdy^/y/n) converges in probability to zero. Moreover, by Lemma 6.11 we know that 
{Tn ,cd.-j-f /y/n) with c = Y^(l + E[C])V[^]/2 converges weakly to the CRT Te- It remains to show 

that there is a constant c' such that the Gromov-Hausdorff distance between {TI,dy^/^/n) and 
{Til,c!d.-^f /y/n) converges in probability to zero. 

' n 

We define the random number 7 as follows. Ghoose a random T^'-gymmetry (R,ct) from 
Ufc>o Syni(' 1 ^ 0 [^] with probability proportional to its weight 

and let 7 denote the dp-distance of the two distinct ^-labels. Note that by our assumptions 
on the cycle index sum Z-jiii. we have that |R| has finite exponential moments. Moreover, the 
diameter of the metric dp is bounded by |R| many independent copies of a real-valued random 
variable x ^ 0 with finite exponential moments. Hence 7 has finite exponential moments. We 
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are going to show that the Gromov-Hausdorff distance of {Tji,dY /-/n) and {TJ,'E[r]]d^f /y/n) 

' n 

converges in probability to zero. By the discussion in the preceding paragraph this implies that 

V(l + E[C])V[g] ,, d 
2E[r]]^ 

and we are done. 

Let s > 1 and t > 0 be arbitrary constants and set Sn = log(n)^ and = nh Let e > 0 be 
given and let £i denote the event that there exists a fixpoint v G tI and an ancestor u oi v with 
the property that 

d^f{u,v)>Sn and dY^{u,v) ^ {1 ± e)E[r]]dT^{u,v). 

' n 

Likewise, let £2 denote the event that there exists a vertex v and an ancestor u of u with 

dT„{u,v)<Sn and dy^{u,v)>tn. 

We are going to show that with high probability none of the events £i and £2 takes place 
This suffices to show the claim: Take s = 2 and t = 1/4 and suppose that the complementary 
events £f and £2 hold. Given vertices a 7 ^ 6 in the tree 7^ let x denote their lowest common 
ancestor. If a: G {o, b} then we have 

(a, &) = dY„ ia,x) + dY^{b,x). 

If X 7 ^ a, b, then let o' denote the offspring of x that lies on the 7/^-path joining a and x and 
likewise b' the offspring of x lying on the path joining x and b. Hence we have that 

dY„{(^,b) = dY^{a,x) +dY„{b,x) + R with R = dY^{a,a ) - dy^ia',x) - dy^ib'^x). 

By property £2 and the triangle inequality it follows that \R\ = —R < Thus, regardless 

whether x G {o, b}, it holds that 

dyAa^b) = dy„{a,x) + dyAb,x) + 

Moreover, if d^f{a,x) > log(n)^, then it follows by property that 

' n 

dy„{a,x) G {l±e)E[r]]d^f{a,x). 

Otherwise, if d^f{a,x) < log(n)^ then it follows by property that dy^{a,x) < and thus 

\dy„{a,x) - E[r]]d^f (a, x)\ < 

for a fixed constant C that does not depend on n or the points a and x. It follows that 
\dyA(^A)/An-E[ri\d^f{a,b)/An\ < eD(7;^)/Vn + o(l), 
with T){Til) denoting the diameter. Thus 

dGH(Y„,E[r?]r/) < eD(r/)/^/n + o(l) 

holds with high probability. Since we may choose e arbitrarily small, and D(7/^)/y'n converges 
in distribution (to a multiple of the diameter of the GRT), it follows that dGH(Yn, E[? 7 ] 7 ^^) —)• 0 
in probability and we are done. 

For each finite subset [/ G N and each 7^-structure R G R^U] let (5/j)ieNo be a family of 
independent copies of the metric 5r. Given a M-T^-symmetry S = ((T,a),a) with label set [k] for 
some k > 0 we may form the family {5^{v))y^T of random metrics by traversing bijectively the 
vertices of T in ascending order I, 2,... A: and assigning to each vertex v the ’’leftmost” unused 
copy from the list • • •)• The metrics can be patched together to a metric d^ on the 

vertex set [k] of the tree T just as described in Section 6.3. 

We may assume that all random variables considered so far are defined on the same probability 
space and that the metric dy^ of coincides with the metric with denoting the sampler 

TZ_a^{p) conditioned on having size n. Given the family let R C U^=oSym(^7^)W 

denote the hnite set symmetries of size n such that the event £i takes place if and only if Z„ G LA. 
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By the definition of the event for any symmetry S = ((T, a), a) € Ti we may choose a fixpoint 
Vs of a having the property that there exists an ancestor u in the tree T with 

dT{u,vs)>Sn and {u,vs) ^ e)K[7]]dTiu,v). 

Let is denote the height hT{vs)- Note that since vs is a fixpoint, the tupel {T,a,vs,cr) is 
a ^^^^-symmetry. By Lemma 6.3 the probability for the sampler rZ_4^(/9) to have size n is 

0(n“^/^) and we have that pTZ' o A^{p) = 1. Hence Equation (6.5) implies that the conditional 
distribution of the event £i given equals 

P(Z„ = 5 I { 5 \) r ,) = 0(n3/2) P(rZ_^(.,)(p) = (5,U5) | { 5 \) r ,). 

S&H S&H ^ 


Let vq, ... ,vi denote the spine of TZ (t) (p), that is, vi is the outer root, vq is the inner root, and 

(uo,..., Vi) is the directed path connecting the roots. It follows that the probability for the event 
£i is bounded by 


l-Sn 




(p) 


n Avk,Vi) {l±e)¥\p\{i-k)) 


£=1 fc=0 


rz (,)(p) 

But the d^ V. / -distance between spine vertices Vi and Vj is distributed like the sum 
r]i + ... + P\i-j\ of independent copies {pi)i of p. We know that p has finite exponential moments 
and hence by the deviation inequality in Lemma 5.3 the bound (*) converges to zero as n = 1 
mod span(w) tends to infinity. Thus with high probability £i does not hold. By the same 
arguments we may bound the probability for the event £2 by 


0(n3/2) ^ 

£=l 


min{s„,£) 

E + ■■■+'nk>tn) 

k=l 


which also converges to zero. This concludes the proof. 


□ 


Proof of Theorem 6.10. It suffices to show that there are constants C,c,N >0 such that for all 
n > N and h > y/n we have that 

P(H(Xn) >h)< C(exp(—c/i^/n) +exp(—ch)). 

For any hxpoint v G Tn set i{v) = sum index u ranging over all ancestors 

of the vertex v in the plane tree . Note that we are summing up the outdegrees in the tree Tn 
and not in the tree tI. Moreover, for any vertex y G 7)i let Vy denote its closest fixpoint, that is, 
Uy = y if y is a fixpoint and otherwise Vy is the unique vertex with y G Fn{vy). If y has height 
^Y„{y) > 2/1 then h.Y^{vy) > h or dY^{u,Vy) > h. Thus either Hy„(7)/) > h ot there exists a 
fixpoint V G Til such that the dY„-diameter Dy of the subspace {u} U F{v) is greater than or 
equal to h. 

Let s > r > 0 be constants. Given h > y/n let £‘^^ denote the event that H(X„) > h. It 
follows that C £q U £i U £2 kl £!f with the events £l given as follows. £q is the event that 
there exists a fixpoint v G Til with Dy > h. £^ is the event that H(7)f) > rh. £2 is the event 
that H(7)f) < rh and i{v) > sh for some fixpoint v G TI. T 3 is the event that i{v) < sh for all 
fixpoints V e Til and Hy„(7)/) > h. 

We are going to show that if we choose r and s sufficiently small, then each of these events is 
sufficiently unlikely. By Lemma 6.11 we have that 

P(4) = 

for some 0 < 7 < 1. Hence there are constants Co,co> 0 such that IP(To) — Go exp(—co/i^/n) if 
h <n and IP(To ) — Go exp(—coh) if /i > n. 
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In order to bound the probability for the event note that the tree Tn conditioned on having 
size ^ is distributed like conditioned on having size That is, it is identically distributed to 
a ^-Galton-Watson tree conditioned on having size t which we denote by Hence 

n 

p(4) < Y,n\rj\ = £)m{r;) > rh). 

1=1 

By Inequality (2.2) there exist constants Ci,ci > 0 that do not depend on n or /i such that for 
all 1 < £ < n we have the tail bound 

P(H(7£0 > fh) < Cl exp(—cir^/i^/£) < Ci exp(—cir^/i^/n). 

In particular, it holds that < Ci exp(—for all n and h. 

We proceed to bound the probability for the event Let Z„ denote the sampler 
conditioned on having size n and let % C U^o Sym(^ 7 ^)[A:] denote the set of ^-/e-symmetries 
S = ((T, a), a) having the property that there exists a hxpoint vertex vs in T with the property 
that £s '■= ^t{vs) < rh and Ylu ^ sh with the sum-index u ranging over all ancestors of 

the vertex v in the tree T. By Equation 6.5 we may bound the probability for the event £2 by 

P(4) = P(Z„ e%) = = {S,vs)). 

Sen 

Let T] denote the outdegree of the root in the sampler TZ (i)(p). By the assumptions on the 

cycle index sum Zj^<^ it follows that p has finite exponential moments. Note that the outdegrees 
along the spine of TZ p) are distributed like independent copies of p. It follows that 

mm{n,rh) 

F{£ 2 ) = 0{n^Y ^ F{pi +... + pe> sh) = 0{n^/‘^)F{pi +... + pirhi> sh). 

e=i 

By the deviation inequality in Lemma 5.3 it follows that there are constants c, A > 0 such that 
the above quantity is bounded by a constant multiple of exp(5/2 log(n) -f crh — Xsh). We assumed 
that h > ^yn, hence if we choose r sufficiently small depending only on s, c and A, it follows that 
there are constants C 2 , C 2 > 0 such that ) < C 2 exp(— 02 / 1 ). 

It remains to treat the event T 3 . By assumption, for any fixpoint v G we have that the 
height hY„(u) is bounded by £(v) many independent copies of a random variable x having finite 
exponential moments. Thus 

¥{£!}) < nF{xi + . ■ • + X[sh\ > h) 

with (xj)igN a family of independent copies of x- By the deviation inequality in Lemma 5.3 there 
are constants c, A > 0 such that this quantity is bounded by a constant multiple of exp (log (n) -|- 
[shjc —Ah). We assumed that h > y/n, hence we may bound this by exp(/i(log(n)/-y/n-|-sc — A)). 
If s is sufficiently small, then it follows that there are constants C 3 ,C 3 > 0 such that P(T 3 ) < 
C 3 exp(—C 3 / 1 ) for all n and h > y/n. 

Thus there exist constants C, c > 0 with 

F(£‘^^) < ^P(Tf) < C(exp(-c/iVn) +exp(-ch)) 
i 

for all n and h > y/n. This concludes the proof. □ 

7.3. Proofs concerning the applications. 

7.3.1. Random weighted graphs. As argued in Section 6.4, Theorems 6.12, 6.13 and 6.14 are 
special cases of the more general results we established in Sections 6.2 and 6.3. 
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7.3.2. Random front-rooted k-dimensional trees. We start with global geometric properties, as 
some intermediate results in there will also be useful in the study of the local properties. 

Proof of Theorem 6.18. By Theorem 4.1 it follows that the largest /C°-component of the random 
front-rooted /c-tree K„ has size n -|- Op(l). Hence Lemma 6.19 readily implies that converges 
toward the CRT after rescaling by the same factor as for K°. □ 

Proof of Lemma 6.19. The random front-rooted fc-tree K° corresponds to the random enriched 
tree for TZ = SEQj^j o SET. Hence our framework applies. 

We first show a tail-bound for the diameter. By the bijection discussed in Section 6.1.3, we 
know that the distance between any vertex in (7^,/3n) and its offspring is always 1 , as the two 
vertices are also joined by an edge in the corresponding /c-tree. Let ^/n < x < n be given. If 
H(K°) > X, then it follows that H(7^ > x). (Here we dehne the height with respect to the vertex 
that corresponds to the root of Tn-) Hence H( 7 j^^) > x /2 or |E(u)| > x /2 for some v G tJ, . Let 
us denote these events by £i and £ 2 - Lemma 6.11 states that there are constants Ci.,ci >0 such 
that uniformly for all n and non-negative x 

(7.24) P(max(|/„(u)| -h |En(t’)|) > x) < exp(-cix). 

veTf 

As we assumed that ^/n < x < n, it follows that 

^(£ 2 ) < exp(—cix/ 2 ) < C 2 exp(—C 2 X^/n) 

with the constants ( 72,02 > 0 not depending on n or x. As for the event £ 1 , Lemma 2.2 implies 
that there are constants ( 73 , C 3 > 0 that do not depend on n or x such that 

n 

nsi) = = i)m{rj) > x/2 1 ir/i = £) 

i=l 

n 

<CsY,n\rJ\=^)^M-c3xy£) 

£=l 

< (73exp(-C3X^/£). 

Thus, there are constants ( 74 , C 4 > 0 such that 

P(H(K°) > x) < P(Ti) + ¥{£ 2 ) < (74 exp(—C 4 X^/n) 

for all n and all ^/n < x < n. It is clear that, by possibly adjusting the constants involved, such 
an inequality also holds for all x > 0. This verifies the exponential tail-bound for the diameter of 
the fc-tree K°. 

It remains to establish the scaling limit. Inequality (7.24) implies that with high probability 
all vertices v G Tn have the property that the number of vertices in the forest F{v) is at most 
0(log(n)). This implies that the (pointed) Gromov-Hausdorff distance between the /c-tree K° 
and the subspace corresponding to the vertices of the fixpoint tree T^ is with high probability at 
most O(logn). Consequently, it suffices to show that there is a constant > 0 such that this 
subspace rescaled by akl\/n converges toward the continuum random tree. 

Let 

( k 

i=l 

We are going to show that there are exponents t > 0 and 1/2 < s < 1 such that with high 
probability 

(7.26) \dK°{u,v) -bkd^f{u,v)\ < ii.j-/(u, u)^-k log* n 

for all u, u G Til. This suffices to complete the proof. Indeed, it follows that with high probability 
(7.27) dGH((r/, dKo), (r/, bud^f)) < J9{Tir + log* n 
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By Lemma 6.11 we know that 


(7.28) 


V(i+E[c])v[e] 


’ o fz: ^l~n 


2y/n 


iTe,drJ. 


in the (pointed) Gromov-Hausdorff sense. In particular, D{tJ) = Op(-v/n), and hence it follows 
from Equation (7.27) that 


(7.29) dGR{{%{, n-i/2^Ko), {%{, hkn-^^^d^^)) ^ 0. 

Equations (7.28) and (7.29) then readily imply that the subspace of the fc-tree K°, that corresponds 
to Tn , converges toward the CRT after rescaling the metric by Ck / y/n with 

_ v/(l + E[C])V[^] 

""" 2bk 

It follows that 

(K°,Cfcn-i/ 2 ^Ko) A(re,drJ- 

Hence Inequality (7.26) is sufficient to complete the proof. 

Let u,v £ tJ be arbitrary vertices and let x £ Tn denote their youngest common ancestor. 
Let o denote the root of tI■ Then any shortest path in K° from o to u, or o to v, or u to u 
contains at least one vertex with c^ko- distance at most 1 from x. Thus the expression 

\dK°{u,v) - {d^<_o{o,u) + dK°^{o,v) - 2 dK°(o,x))| 

is bounded by a fixed constant that does not depend on u, v ot x. Thus, in order to show 
Inequality (7.26), it suffices to show that for a sufficiently small but fixed constant c > 0 and it 
holds with high probability that 

(7.30) |dK°(o,u) - 6 fc(i,/(o,u)| < c{d^f{o,vy + log* n) 
for all V £TJ . 

To this end, let denote the sampler (p) conditioned on having size n. That is, 

Zn is the symmetry corresponding to the Sym(77)-enriched tree {Tn,f3n)- Consider the set 
Ti C U^o [k] of . 4 . 7 ^-symmetries S = ((T, a), (t) having the property that the there exists 

a fixpoint vertex vs in T with the property that is ■= h'r(u 5 ) > log*(n) but the corresponding 
distance h*(u 5 ) in the /c-tree corresponding to {T,a) satisfies 


|h*('f^s) - his\ > cis- 

By Lemma 6.3 the probability for the sampler TZj\<^{p) to have size n is 0(n“^/^). It follows 
from Equation 6.5 that 

(7.31) P(Z„ £%) = 0{ry/y Y,nVZ^,^„(p)^{S,vs))<0{r?l^) Y. w. 

S^l-L \o^ n<l<n 

with Pi denoting the probability that the fc-tree distance di between the root vertex vq and the 
tip Vi of the spine ... ^Vi in TZ {p) satisfies 

'^'R. 

\de - bki\ > cf. 


We are going to bound the pi to show that the bound in (7.31) converges to zero. In order to 
simplify the calculations, let u denote any fixed vertex of the ^-place-holder root-front of the 
/c-tree corresponding to FZ (eg){p). Let d'^ denote the /c-tree distance from u to v^. Thus 

\de — < 1 


for all i and it suffices to study the deviation of from bki- We are going to exploit properties of 
the bijection in Section 6.1.3 to take a Markov chain approach. Consider the set Mq consisting of 
all k unlabelled vertices. The vertex vi is incident to vq and to a /c — 1-element subset Sq C Mq. 
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By the construction in Lemma 6.4 each {k — l)-element subset of Mq is equally likely. The 
distance of vi to u is given by 

d'i = 1 + min d{u, v) 
v£So 

with d{-, •) denoting the k-tiee distance. This follows from the fact that MoUjuo} is a (/c+l)-clique. 
Setting Ml = {t^o} U Sq, the distance of V 2 to u is again given by 

d '2 = I + min d{u, v) 

v&Si 

with Si C Ml denoting the {k — l)-element subset of Mi that is incident with V 2 - Here each 
{k — l)-element subset of Mi is (conditionally) equally likely. We may continue this construction, 
yielding sequences Sq, , Si-i and Mq, . .., Mi, such that for all 0 < z < £ — 1 it holds that 

(7.32) d[:i = 1 + mind(u, u) and Mj+i = {ui} U Si, 

v&Si 

and such that, conditioned on Mi, the subset Si gets drawn uniformly at random among the 
{k — l)-element subsets of Mj. Note that for all v £ Mi it holds that d{u, v) = d[ or d{u, v) = d[ — l. 
For all 0 < i < £ we let 1 < Xi < k denote the number of vertices in Mj with d{u, v) = d[ — 1. 
Equation (7.32) implies that for all 0 < i < .£ — 1 

,/ _ i ^i+i < k 

+ Xi+i = k' 

As dg = 1 and Xq = 1, it holds that 


(7.33) 


e 

4 = 1 + 

i=l 


Recall that given Mi, the set Si gets drawn uniformly at random from the {k — l)-element subsets 
of Mi. Thus, {Xi)i is Markov chain with the transition probabilities pij = P(Xn+i = j \ Xn = i) 
given by the matrix 


P — {pij)i,j 



k-2 



k-3 

k 


1 

k 


\ 




V 


k-1 1 

k k 

1 




Here we use the convention, that empty spaces in a matrix denote zero entries. The stationary 
distribution is given by 


^ — {T^i)l<i<k — ^ 

2^i=l i 

This chain is clearly irreducible. However, unless A: = 2, it is not reversible. As the multiplicative 
symmetrization P^P is irreducible, we may apply Lemma 5.2 to obtain that there are constants 
a,b > 0 such that for all sufficiently small e > 0 and all £ > 1 it holds that 



^{\d'i — 1 — TTfc^l > e£) < aexp(—b€^£). 


As TTfc = bk, follows that for any 1/2 < s < 1 we may choose f > 0 large enough such that the 
bound in (7.31) tends to zero as n becomes large. 

Thus, it holds with high probability that all vertices v £ with d^f{o,v) > log*n satisfy 

' n 

(7.34) \dK<^{o,v) - bkd..j-f{o,v)\ <cd.^f{o,vy. 

This readily verifies Equation (7.30) and hence completes the proof. □ 
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Proof of Theorem 6.16. Let kn = o{y/n) be a given sequence. The random front-rooted unlabelled 
k-tiee K„ may be viewed as a Gibbs partition. Theorem 4.1 ensures that K„ exhibits a giant 
component, and that the small fragments converge in total variation toward a Boltzmann limit. 
Thus, it suffices to show that 

(7.35) dTv(^A:JK°),yfcJK°))^0 

as n becomes large. Set hn = ^b^kn + = o{y/n) with bk defined in Equation (7.25). 

Theorem 6.5 ensures that 

dTY{{Tn,l3n)^^->, (.^(oo)^ ^(oo))</^„>) ^ 

as n becomes large. By Equation (7.34) we know that with high probability the trimmed 
tree {Tn, already contains all information required to determine the fe^-neighbourhood 

14„(K°). This verifies Equation (7.35) and hence completes the proof. □ 

Proof of Theorem 6.17. Let k^ = o{y/n) be a given sequence. Theorem 4.1 ensures that 
exhibits a giant component, which is distributed like K(r„)° for some random size = n-|-Op(l). 
A uniformly at random selected vertex of lies with high probability in this giant component and 
outside of its root-front. Conditioned on this event, the random vertex is uniformly distributed 
among the non-root vertices of the large component. Note that every path from a vertex of the 
giant component to a vertex of a smaller component must pass through the root-front. Thus, it 
suffices to show that if v* is a random vertex of {Tn,/3n), then 

dTv(f4„(K°,u*),Vfc„(K°))^0, 

and with high probability 14^(K°,u*) contains no vertex of the root-front of K°. Set hn = 
2bkkn + = o{^/n) with b^ defined in Equation (7.25). By Theorem 6.8 it follows that 

dTv((Hf;,„],u*),(H[,„],u*))^0. 

and that with high probability the vertex v* has height > hn. By Equation (7.34) it 

follows that with high probability it holds that hKo (u*) > kn and that (H|^ j, v*) already contains 
all information necessary to determine 14^(K°,u*). This completes the proof. □ 

7.3.3. Simply generated Polya trees. We argued in Section 6.6 how Theorems 6.20, 6.21 and 6.22 
follow from the results on random 77-enriched trees of Sections 6.2 and 6.3. 
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